AI-Powered Real-Time Mandarin Speaking Grader Launches Free Tool

A new AI-powered tool is giving Mandarin learners instant feedback on their speaking skills, addressing a gap left by expensive tutors and generic conversation apps. ToneTutor, a free web-based platform, evaluates tone accuracy, grammar, and vocabulary in real time, providing an estimated HSK level within minutes.
How the Technology Works Behind the Scenes
The system combines several advanced components to deliver accurate assessments. On the frontend, the Web Audio API records spoken responses, converting them into raw audio data. A React and TypeScript interface displays transcripts in real time, while the backend—powered by FastAPI on Google Cloud Run—handles processing. The key innovation lies in audio format handling: since WebM recordings from browsers must be converted to LINEAR16 (WAV) for Google’s Gemini 2.5 Flash model, the tool performs a browser-side transcoding step before sending the data. This ensures compatibility even on iOS Safari, which lacks WebM support.
Gemini’s speech-to-text capabilities then generate a transcript, which is evaluated against HSK proficiency criteria. The grading logic, implemented in Python, uses an async function to analyze tone accuracy, grammar, and vocabulary range, returning a level estimate from HSK 1 to 6 along with specific feedback on weak points. Results are stored in Firestore, allowing users to track progress and share their HSK scorecards.
A Practical Solution for Self-Assessment
Unlike traditional HSK speaking tests that require scheduling or costly tutors, ToneTutor offers a quick three-minute assessment with immediate results. The tool is free for a limited number of sessions, making it accessible to learners who want to monitor their progress without financial barriers. Its developer, a native Mandarin speaker and voice actor, designed it to fill a real need in language learning—where self-assessment tools often overlook tone accuracy, a critical component of intelligibility.
While the full open-source release is still pending, the platform is already live for testing at tonetutor.tefusiang.com. For those curious about the technical pipeline or the grading logic, the developer invites questions and feedback, signaling a community-driven approach to refining AI-assisted language learning.
Source: DEV Community. AI-assisted editorial synthesis — TechnoExpress.

