Client Presentation · July 2026

VoiceForm AI

Tamil AI Voice Assistant & Intelligent Form Filling

Speak Transcribe Auto-fill Submit

Phase

Phase 1 MVP Delivered

Languages

Tamil · English · Malayalam

Stack

React · FastAPI · PostgreSQL

Prepared by

Wizbees

Overview

Presentation Agenda

A complete walkthrough of VoiceForm AI — from problem to pilot roadmap.

01 Executive Summary

02 Problem & Use Cases

03 Live Voice Example

04 Phase 1 Delivered

05 Form Builder

06 User Personas

07 Business Benefits

08 Product Flow

09 Technology Stack

10 Architecture

11 Security

12 Admin Analytics

13 Phase 2 Plan

14 Roadmap Timeline

15 Next Steps

Executive Summary

One Platform, Four Capabilities

VoiceForm AI unifies voice capture, form management, speech-to-form mapping, and operational intelligence in a single Tamil-first stack.

0

API Endpoints

0

Form Field Types

0

Voice Languages

0

Automated Tests

What It Does

Lets people fill forms by speaking instead of typing. Admins design forms, publish public URLs, and review submissions with full voice pipeline analytics.

Who It Serves

Schools, hospitals, government offices, HR teams, and field survey organizations that need hands-free Tamil data entry with admin oversight.

The Challenge

The Problem We Solve

Organizations lose time, accuracy, and completion rates when data entry depends on typing — especially in Tamil.

Slow Manual Data Entry

Long forms on mobile are painful to type. Field workers abandon forms before finishing.

Weak Tamil Support

Generic form tools lack Tamil speech recognition and culturally appropriate field parsing.

Opaque AI Costs

STT and LLM usage is billed per call with no per-form or per-request cost breakdown.

Fragmented Tooling

Separate products for forms, transcription, and analytics increase integration cost.

Real-World Examples

Industry Use Cases

Concrete scenarios where voice form filling saves time and improves accuracy.

Education

Student Admission Form

Form: student-admission · 12 fields

Parent speaks student details in Tamil during school admission drive. No typing on mobile — review and submit in under 2 minutes.

"என் மகன் பெயர் அருண், வயது 10, சென்னை..."

Healthcare

Patient Intake Form

Form: patient-intake · 15 fields

Nurse records patient symptoms and history by voice at reception. Auto-fills name, age, phone, and complaint fields for doctor review.

"நோயாளி பெயர் மீனா, வயது 45, காய்ச்சல் மற்றும் தலைவலி..."

Government

Citizen Service Request

Form: service-request · 10 fields

Field officer captures citizen grievance details hands-free during door-to-door surveys. Works offline with Whisper, syncs when connected.

"பெயர் கணேசன், ஊர் திருச்சி, பிரச்சினை தண்ணீர் வழங்கல்..."

HR & Payroll

Employee Onboarding

Form: employee-onboard · 18 fields

HR speaks new hire details during orientation. Gemini AI extracts name, department, salary grade, and start date from natural speech.

"புதிய ஊழியர் ராஜேஷ், துறை IT, சம்பளம் 45000, தொடக்க தேதி..."

Live Example

Tamil Voice → Auto-Filled Form

Student admission form — speak once, fields populate automatically.

Recording & Transcript

Auto-Filled Fields

Student Name

Age

City

Parent Phone

Email

Class Applying

Phase 1 — Delivered

What We Built

A complete Tamil-first voice form platform — ready to demo and pilot today.

Voice Capture

Browser recording and file upload with automatic background transcription.

Form Builder

Drag-and-drop designer with 17 field types. Publish forms via public URL slug.

Public Voice Forms

Respondents speak or upload audio; fields auto-fill for review before submit.

Speech-to-Text

Local Whisper ASR or Google Cloud STT (Chirp 2) — per-request provider choice.

Smart Autofill

Rule-based NLP or Gemini 2.5 Flash AI — with automatic fallback to rules.

Admin Voice Reports

KPIs, daily cost rollups, pipeline history, audio replay, and CSV export.

Form Designer

Drag-and-Drop Form Builder

17 field types across 4 groups — design once, publish via public URL slug.

Basic Controls

Text BoxText AreaNumber EmailPhonePassword

Selection Controls

DropdownRadioCheckboxToggle

Date & Time

Date PickerTime Picker

Advanced

File UploadAudio RecordImage SignatureVoice Summary

Admin Workflow

Drag fields onto canvas in visual designer
Set labels, validation, required flags, aliases
Choose autofill mode: rules or Gemini AI
Publish form — get shareable URL: /forms/student-admission
Export responses as CSV from admin panel
View audit log of all admin actions

Users

Who Uses VoiceForm AI

Four personas — each with a dedicated entry point and workflow.

C

Field Agent / Citizen

Public respondent

Opens published form URL, speaks answers, reviews auto-filled fields, submits — no login required.

K

Knowledge Worker

Authenticated user

Records voice notes, manages recordings, searches transcripts, favorites and archives notes.

A

Form Administrator

Admin role

Builds forms in designer, manages users, exports responses, configures autofill and STT provider.

O

Operations Lead

Admin role

Reviews Voice Reports KPIs, daily AI cost rollups, replays audio, audits pipeline history.

Value Proposition

Business Benefits

Measurable value for field teams, administrators, and operations leads.

Faster Data Capture

Voice is quicker than typing — especially for Tamil and long narrative answers.

Higher Completion Rates

Auto-fill reduces friction; users only correct mismatches before submitting.

Cost Transparency

Estimated STT and Gemini USD per voice record — no surprise cloud bills during pilots.

Improvement Loop

Transcripts, corrections, and audio paths support tuning parsers and models over time.

Flexible Deployment

Local Whisper avoids cloud STT cost in dev; Google STT for production quality.

Single Integrated Stack

One codebase for forms, voice, and admin — lower integration overhead.

Product Flow

How It Works

From spoken Tamil to structured data in five simple steps.

1

Record

2

STT

3

Autofill

4

Review

5

Submit

Ideal for organizations that need hands-free, Tamil-first data entry

Schools Hospitals Government HR & Payroll Field Surveys Customer Support

Technology

Enterprise Technology Stack

Modern, production-grade components — self-hosted and cloud-ready.

Layer	Technology	Purpose
Frontend	React 19, TypeScript, Vite, MUI, Tailwind	Single-page app, form builder UI
Backend	FastAPI, Python 3.11+, SQLAlchemy 2	REST API, async voice pipeline
Database	PostgreSQL 16 + pgvector	Structured data, future semantic search
Speech-to-Text	Whisper (local) + Google Chirp 2	Tamil / English / Malayalam ASR
AI Autofill	Gemini 2.5 Flash + rule-based NLP	Map transcript text to form fields
Auth	JWT access + refresh, bcrypt, RBAC	Secure admin and user sessions

:4325

Frontend Port

:8236

API Port

:8235

Whisper Port

57+

API Endpoints

Architecture

System Architecture

Four integrated layers — from browser to database and AI services.

UI

Presentation — React SPA

Form builder, public voice forms, admin panel, Voice Reports dashboard

API

Application — FastAPI Backend

Auth, forms CRUD, STT orchestration, NLP autofill, Gemini client, cost estimator

AI

AI & Voice Services

Whisper microservice (port 8235), Google Cloud STT, Gemini 2.5 Flash autofill

DB

Data — PostgreSQL 16

Users, forms, responses, voice dataset records, audit logs, Alembic migrations

Security

Security & Governance

Enterprise controls built into Phase 1 — with Phase 2 hardening planned.

JWT Authentication

Access + refresh tokens, bcrypt password hashing, automatic token refresh in frontend.

Role-Based Access

Admin vs user roles. API 403 + UI route guards prevent unauthorized admin access.

Audit Logging

Every admin action logged — who did what, when. Full trail for compliance review.

Phase 2: Rate Limiting

Protect public voice endpoints from abuse. Optional PIN/password on public forms.

Phase 2: MFA for Admin

Multi-factor authentication for admin accounts. Encryption at rest for uploads.

Phase 2: WAF / DDoS

Production edge security with Azure WAF. GDPR data export and deletion tools.

Analytics

Admin Voice Reports

Full operational visibility — every voice interaction logged with estimated costs.

Rules Autofill AccuracyTarget ≥ 80%

Form Completion RatePilot target 90%

Tamil Transcription QualityManual QA pass

Phase 1 Test Coverage25+ pytest tests

Voice Reports Dashboard

KPI cards: total voice records, STT calls, Gemini calls
Daily cost rollup — estimated USD per STT and Gemini request
Per-record history with audio replay and transcript view
CSV export for finance and operations review
Configurable rates: Google STT $0.016/min, Whisper $0 local
INR display via configurable USD→INR rate (default 84.5)

Phase 2 — Planned

Phase 2: Pilot & AI Upgrades

Validate with real users, then enhance accuracy and intelligence.

Pilot Deployment

Deploy to staging environment (Azure App Service)
UAT walkthrough on your target forms
Tune autofill rules for your domain vocabulary
Measure completion rate and field accuracy
Collect stakeholder sign-off for production

AI Enhancements

Improve Tamil form slot filler (rule-based patterns)
Google Cloud Speech-to-Text integration (Chirp 2)
RAG knowledge-base form filling from uploaded PDFs
Semantic search via pgvector (notes and transcripts)
AI chat assistant for form guidance

Phase 2 — Platform

Platform & Production Readiness

Harden the platform for real-world deployment and integration.

Cloud Blob Storage

Azure Blob or S3 for scalable audio and file uploads beyond local disk.

Rate Limiting

Protect public voice endpoints from abuse and spam submissions.

CI/CD Pipeline

GitHub Actions for automated test, build, and deploy to staging/production.

PDF / Excel Reports

Rich export formats beyond CSV for management and compliance reporting.

Mobile PWA

Installable progressive web app with offline record queue for field agents.

Webhook Integrations

Push form submissions to CRM, ERP, or custom systems on submit.

Roadmap

Implementation Timeline

Clear phases from MVP delivery through production scale.

✓

Phase 1 — MVP

Core platform delivered. Auth, forms, STT, autofill, Voice Reports.

2

Phase 2 — Pilot

Staging deploy, UAT, autofill tuning. Est. 4–6 weeks.

3

Phase 3 — Production

HTTPS, object storage, monitoring, credential rotation.

4

Phase 4 — Scale

Rate limits, message queues, CDN, load testing.

M7 — Pilot Deployment

Phase 2 milestone. Staging live, UAT complete, metrics validated.

M8 — Production Go-Live

Phase 3 milestone. HTTPS production URL, monitoring, sign-off.

M9 — Scale & Hardening

Phase 4 milestone. Load tested, queue workers, CDN ready.

Success Criteria

Pilot Success Metrics

Measurable targets for UAT sign-off and Phase 2 pilot completion.

Metric	Target	How Measured
Public form voice completion	Record → fill → submit works	UAT walkthrough script
Tamil transcription quality	Intelligible for clear speech	Manual QA + STT confidence
Rules autofill accuracy	≥ 80% labeled fields correct	test_form_auto_fill.py suite
Admin cost visibility	KPIs load without error	Voice Reports page + API
Auth & RBAC	Non-admin blocked from admin	API 403 + UI route guard
Pilot deployment	Staging live on Azure	M7 milestone sign-off

Before VoiceForm

8–12 min per form on mobile typing. ~40% abandonment on long Tamil forms.

With VoiceForm

2–3 min per form by voice. User only corrects mismatches — submits faster.

Phase 2 Goal

≥ 90% field accuracy on your domain forms after autofill tuning.

Next Steps

Ready to Move Forward?

Three actions to begin your VoiceForm AI pilot.

1

Schedule Pilot UAT

Book a walkthrough session. We demo live voice form filling on your target use cases.
2

Provision Staging Environment

Deploy to Azure App Service (~₹9,720/month production tier, or ~₹2,370 ultra-budget POC).
3

Define Target Forms

Share your form templates (student, employee, survey, etc.) for autofill tuning in Phase 2.

VoiceForm AI

Presentation Agenda

One Platform, Four Capabilities

What It Does

Who It Serves

The Problem We Solve

Slow Manual Data Entry

Weak Tamil Support

Opaque AI Costs

Fragmented Tooling

Industry Use Cases

Student Admission Form

Patient Intake Form

Citizen Service Request

Employee Onboarding

Tamil Voice → Auto-Filled Form

What We Built

Voice Capture

Form Builder

Public Voice Forms

Speech-to-Text

Smart Autofill

Admin Voice Reports

Drag-and-Drop Form Builder

Admin Workflow

Who Uses VoiceForm AI

Field Agent / Citizen

Knowledge Worker

Form Administrator

Operations Lead

Business Benefits

Faster Data Capture

Higher Completion Rates

Cost Transparency

Improvement Loop

Flexible Deployment

Single Integrated Stack

How It Works

Enterprise Technology Stack

System Architecture

Presentation — React SPA

Application — FastAPI Backend

AI & Voice Services

Data — PostgreSQL 16

Security & Governance

JWT Authentication

Role-Based Access

Audit Logging

Phase 2: Rate Limiting

Phase 2: MFA for Admin

Phase 2: WAF / DDoS

Admin Voice Reports

Voice Reports Dashboard

Phase 2: Pilot & AI Upgrades

Pilot Deployment

AI Enhancements

Platform & Production Readiness

Cloud Blob Storage

Rate Limiting

CI/CD Pipeline

PDF / Excel Reports

Mobile PWA

Webhook Integrations

Implementation Timeline

Phase 1 — MVP

Phase 2 — Pilot

Phase 3 — Production

Phase 4 — Scale

M7 — Pilot Deployment

M8 — Production Go-Live

M9 — Scale & Hardening

Pilot Success Metrics

Before VoiceForm

With VoiceForm

Phase 2 Goal

Ready to Move Forward?

Schedule Pilot UAT

Provision Staging Environment

Define Target Forms