Client Presentation · July 2026

VoiceForm AI

Tamil AI Voice Assistant & Intelligent Form Filling

Speak Transcribe Auto-fill Submit

Phase
Phase 1 MVP Delivered
Languages
Tamil · English · Malayalam
Stack
React · FastAPI · PostgreSQL
Prepared by
Wizbees
Overview

Presentation Agenda

A complete walkthrough of VoiceForm AI — from problem to pilot roadmap.

01 Executive Summary
02 Problem & Use Cases
03 Live Voice Example
04 Phase 1 Delivered
05 Form Builder
06 User Personas
07 Business Benefits
08 Product Flow
09 Technology Stack
10 Architecture
11 Security
12 Admin Analytics
13 Phase 2 Plan
14 Roadmap Timeline
15 Next Steps
Executive Summary

One Platform, Four Capabilities

VoiceForm AI unifies voice capture, form management, speech-to-form mapping, and operational intelligence in a single Tamil-first stack.

0
API Endpoints
0
Form Field Types
0
Voice Languages
0
Automated Tests

What It Does

Lets people fill forms by speaking instead of typing. Admins design forms, publish public URLs, and review submissions with full voice pipeline analytics.

Who It Serves

Schools, hospitals, government offices, HR teams, and field survey organizations that need hands-free Tamil data entry with admin oversight.

The Challenge

The Problem We Solve

Organizations lose time, accuracy, and completion rates when data entry depends on typing — especially in Tamil.

Slow Manual Data Entry

Long forms on mobile are painful to type. Field workers abandon forms before finishing.

Weak Tamil Support

Generic form tools lack Tamil speech recognition and culturally appropriate field parsing.

Opaque AI Costs

STT and LLM usage is billed per call with no per-form or per-request cost breakdown.

Fragmented Tooling

Separate products for forms, transcription, and analytics increase integration cost.

Real-World Examples

Industry Use Cases

Concrete scenarios where voice form filling saves time and improves accuracy.

Education

Student Admission Form

Form: student-admission · 12 fields

Parent speaks student details in Tamil during school admission drive. No typing on mobile — review and submit in under 2 minutes.

"என் மகன் பெயர் அருண், வயது 10, சென்னை..."
Healthcare

Patient Intake Form

Form: patient-intake · 15 fields

Nurse records patient symptoms and history by voice at reception. Auto-fills name, age, phone, and complaint fields for doctor review.

"நோயாளி பெயர் மீனா, வயது 45, காய்ச்சல் மற்றும் தலைவலி..."
Government

Citizen Service Request

Form: service-request · 10 fields

Field officer captures citizen grievance details hands-free during door-to-door surveys. Works offline with Whisper, syncs when connected.

"பெயர் கணேசன், ஊர் திருச்சி, பிரச்சினை தண்ணீர் வழங்கல்..."
HR & Payroll

Employee Onboarding

Form: employee-onboard · 18 fields

HR speaks new hire details during orientation. Gemini AI extracts name, department, salary grade, and start date from natural speech.

"புதிய ஊழியர் ராஜேஷ், துறை IT, சம்பளம் 45000, தொடக்க தேதி..."
Live Example

Tamil Voice → Auto-Filled Form

Student admission form — speak once, fields populate automatically.

Recording & Transcript
Auto-Filled Fields
Phase 1 — Delivered

What We Built

A complete Tamil-first voice form platform — ready to demo and pilot today.

Voice Capture

Browser recording and file upload with automatic background transcription.

Form Builder

Drag-and-drop designer with 17 field types. Publish forms via public URL slug.

Public Voice Forms

Respondents speak or upload audio; fields auto-fill for review before submit.

Speech-to-Text

Local Whisper ASR or Google Cloud STT (Chirp 2) — per-request provider choice.

Smart Autofill

Rule-based NLP or Gemini 2.5 Flash AI — with automatic fallback to rules.

Admin Voice Reports

KPIs, daily cost rollups, pipeline history, audio replay, and CSV export.

Form Designer

Drag-and-Drop Form Builder

17 field types across 4 groups — design once, publish via public URL slug.

Basic Controls
Text BoxText AreaNumber EmailPhonePassword
Selection Controls
DropdownRadioCheckboxToggle
Date & Time
Date PickerTime Picker
Advanced
File UploadAudio RecordImage SignatureVoice Summary

Admin Workflow

  • Drag fields onto canvas in visual designer
  • Set labels, validation, required flags, aliases
  • Choose autofill mode: rules or Gemini AI
  • Publish form — get shareable URL: /forms/student-admission
  • Export responses as CSV from admin panel
  • View audit log of all admin actions
Users

Who Uses VoiceForm AI

Four personas — each with a dedicated entry point and workflow.

C

Field Agent / Citizen

Public respondent

Opens published form URL, speaks answers, reviews auto-filled fields, submits — no login required.

K

Knowledge Worker

Authenticated user

Records voice notes, manages recordings, searches transcripts, favorites and archives notes.

A

Form Administrator

Admin role

Builds forms in designer, manages users, exports responses, configures autofill and STT provider.

O

Operations Lead

Admin role

Reviews Voice Reports KPIs, daily AI cost rollups, replays audio, audits pipeline history.

Value Proposition

Business Benefits

Measurable value for field teams, administrators, and operations leads.

Faster Data Capture

Voice is quicker than typing — especially for Tamil and long narrative answers.

Higher Completion Rates

Auto-fill reduces friction; users only correct mismatches before submitting.

Cost Transparency

Estimated STT and Gemini USD per voice record — no surprise cloud bills during pilots.

Improvement Loop

Transcripts, corrections, and audio paths support tuning parsers and models over time.

Flexible Deployment

Local Whisper avoids cloud STT cost in dev; Google STT for production quality.

Single Integrated Stack

One codebase for forms, voice, and admin — lower integration overhead.

Product Flow

How It Works

From spoken Tamil to structured data in five simple steps.

Ideal for organizations that need hands-free, Tamil-first data entry

Schools Hospitals Government HR & Payroll Field Surveys Customer Support
Technology

Enterprise Technology Stack

Modern, production-grade components — self-hosted and cloud-ready.

LayerTechnologyPurpose
FrontendReact 19, TypeScript, Vite, MUI, TailwindSingle-page app, form builder UI
BackendFastAPI, Python 3.11+, SQLAlchemy 2REST API, async voice pipeline
DatabasePostgreSQL 16 + pgvectorStructured data, future semantic search
Speech-to-TextWhisper (local) + Google Chirp 2Tamil / English / Malayalam ASR
AI AutofillGemini 2.5 Flash + rule-based NLPMap transcript text to form fields
AuthJWT access + refresh, bcrypt, RBACSecure admin and user sessions
:4325
Frontend Port
:8236
API Port
:8235
Whisper Port
57+
API Endpoints
Architecture

System Architecture

Four integrated layers — from browser to database and AI services.

UI

Presentation — React SPA

Form builder, public voice forms, admin panel, Voice Reports dashboard

API

Application — FastAPI Backend

Auth, forms CRUD, STT orchestration, NLP autofill, Gemini client, cost estimator

AI

AI & Voice Services

Whisper microservice (port 8235), Google Cloud STT, Gemini 2.5 Flash autofill

DB

Data — PostgreSQL 16

Users, forms, responses, voice dataset records, audit logs, Alembic migrations

Security

Security & Governance

Enterprise controls built into Phase 1 — with Phase 2 hardening planned.

JWT Authentication

Access + refresh tokens, bcrypt password hashing, automatic token refresh in frontend.

Role-Based Access

Admin vs user roles. API 403 + UI route guards prevent unauthorized admin access.

Audit Logging

Every admin action logged — who did what, when. Full trail for compliance review.

Phase 2: Rate Limiting

Protect public voice endpoints from abuse. Optional PIN/password on public forms.

Phase 2: MFA for Admin

Multi-factor authentication for admin accounts. Encryption at rest for uploads.

Phase 2: WAF / DDoS

Production edge security with Azure WAF. GDPR data export and deletion tools.

Analytics

Admin Voice Reports

Full operational visibility — every voice interaction logged with estimated costs.

Rules Autofill AccuracyTarget ≥ 80%
Form Completion RatePilot target 90%
Tamil Transcription QualityManual QA pass
Phase 1 Test Coverage25+ pytest tests

Voice Reports Dashboard

  • KPI cards: total voice records, STT calls, Gemini calls
  • Daily cost rollup — estimated USD per STT and Gemini request
  • Per-record history with audio replay and transcript view
  • CSV export for finance and operations review
  • Configurable rates: Google STT $0.016/min, Whisper $0 local
  • INR display via configurable USD→INR rate (default 84.5)
Phase 2 — Planned

Phase 2: Pilot & AI Upgrades

Validate with real users, then enhance accuracy and intelligence.

Pilot Deployment

  • Deploy to staging environment (Azure App Service)
  • UAT walkthrough on your target forms
  • Tune autofill rules for your domain vocabulary
  • Measure completion rate and field accuracy
  • Collect stakeholder sign-off for production

AI Enhancements

  • Improve Tamil form slot filler (rule-based patterns)
  • Google Cloud Speech-to-Text integration (Chirp 2)
  • RAG knowledge-base form filling from uploaded PDFs
  • Semantic search via pgvector (notes and transcripts)
  • AI chat assistant for form guidance
Phase 2 — Platform

Platform & Production Readiness

Harden the platform for real-world deployment and integration.

Cloud Blob Storage

Azure Blob or S3 for scalable audio and file uploads beyond local disk.

Rate Limiting

Protect public voice endpoints from abuse and spam submissions.

CI/CD Pipeline

GitHub Actions for automated test, build, and deploy to staging/production.

PDF / Excel Reports

Rich export formats beyond CSV for management and compliance reporting.

Mobile PWA

Installable progressive web app with offline record queue for field agents.

Webhook Integrations

Push form submissions to CRM, ERP, or custom systems on submit.

Roadmap

Implementation Timeline

Clear phases from MVP delivery through production scale.

Phase 1 — MVP

Core platform delivered. Auth, forms, STT, autofill, Voice Reports.

2

Phase 2 — Pilot

Staging deploy, UAT, autofill tuning. Est. 4–6 weeks.

3

Phase 3 — Production

HTTPS, object storage, monitoring, credential rotation.

4

Phase 4 — Scale

Rate limits, message queues, CDN, load testing.

M7 — Pilot Deployment

Phase 2 milestone. Staging live, UAT complete, metrics validated.

M8 — Production Go-Live

Phase 3 milestone. HTTPS production URL, monitoring, sign-off.

M9 — Scale & Hardening

Phase 4 milestone. Load tested, queue workers, CDN ready.

Success Criteria

Pilot Success Metrics

Measurable targets for UAT sign-off and Phase 2 pilot completion.

MetricTargetHow Measured
Public form voice completionRecord → fill → submit worksUAT walkthrough script
Tamil transcription qualityIntelligible for clear speechManual QA + STT confidence
Rules autofill accuracy≥ 80% labeled fields correcttest_form_auto_fill.py suite
Admin cost visibilityKPIs load without errorVoice Reports page + API
Auth & RBACNon-admin blocked from adminAPI 403 + UI route guard
Pilot deploymentStaging live on AzureM7 milestone sign-off

Before VoiceForm

8–12 min per form on mobile typing. ~40% abandonment on long Tamil forms.

With VoiceForm

2–3 min per form by voice. User only corrects mismatches — submits faster.

Phase 2 Goal

≥ 90% field accuracy on your domain forms after autofill tuning.

Next Steps

Ready to Move Forward?

Three actions to begin your VoiceForm AI pilot.

  1. 1

    Schedule Pilot UAT

    Book a walkthrough session. We demo live voice form filling on your target use cases.

  2. 2

    Provision Staging Environment

    Deploy to Azure App Service (~₹9,720/month production tier, or ~₹2,370 ultra-budget POC).

  3. 3

    Define Target Forms

    Share your form templates (student, employee, survey, etc.) for autofill tuning in Phase 2.