DEV Community

Cover image for LingoLens ๐ŸŽค: Speak, Transcribe, Translate
Arion Dev
Arion Dev

Posted on

LingoLens ๐ŸŽค: Speak, Transcribe, Translate

This is a submission for the AssemblyAI Challenge: Sophisticated Speech-to-Text && Really Rad Real-Time && No More Monkey Business.


What I Built

LingoLens ๐ŸŽ™๏ธ is a web application designed to revolutionize how we process and analyze audio content.

Powered by AssemblyAI's LEMUR API and Gemini for enhanced AI processing, it allows users to transcribe, translate, and gain insights from audio files in real-time.

With an interactive, animated UI built using Next.js, TailwindCSS, and Framer Motion, LingoLens makes audio data accessible and actionable across different languages and contexts.


Key Features:

  • ๐ŸŽ™๏ธ Real-Time Speech-to-Text: Converts speech into text in multiple languages.
  • ๐ŸŒ Language Translation: Translates the transcriptions into various languages.
  • ๐Ÿ—ฃ๏ธ Speaker Diarization: Identifies and labels speakers in the conversation.
  • ๐Ÿ“Š Audio Analytics: Provides sentiment analysis, keyword extraction, and summarized insights.
  • ๐Ÿ’ซ Interactive UI: Beautiful animations and smooth user transitions powered by Framer Motion.
  • ๐Ÿ“‚ Export Options: Download or share transcriptions and analytics.

Demo

๐ŸŽฅ YouTube Demo: โ†˜๏ธ

๐Ÿฆ„ Live Demo: LingoLens Live in Vercel

๐Ÿ˜ป GitHub Repository: LingoLens Codebase


Screenshots ๐Ÿ“ธ

Hereโ€™s a glimpse of the app in action:

  1. ๐Ÿ  Home Page

    Home Page Screenshot

  2. ๐Ÿ“‚ Upload Audio Page

    Upload Audio Screenshot

  3. ๐Ÿ“œ Results Page

    Results Page

โœ… Select language to translate
Select Language

โœ… Translating to Chinese
Translating to Chinese

โœ… Translated to Chinese
Translated

โœ… Generated Blog from transcript
Blog Post

โœ… Generated Social Post (summary) from transcript
Social Post

โœ… Ask any Question related to transcript
Ask Question

โœ… Get instant answer
get Answer

โœ… Save this page in history for future reference
Save

4.๐Ÿช„History
History

5.๐Ÿ…ฐ๏ธAbout LingoLens

About LingoLens

6.๐Ÿ“žContact Me
Contact Me


Journey

๐ŸŽค Universal-2, AssemblyAIโ€™s Speech-to-Text model, is at the core of this application, enabling efficient and accurate transcriptions. Hereโ€™s how the LEMUR API powers the features:

  1. โœ๏ธ Transcription: Real-time conversion of audio to text, supporting multiple languages.
  2. ๐ŸŒ Translation: Transcriptions are sent to AssemblyAI for language translation, making the app globally accessible.
  3. ๐Ÿง‘โ€๐Ÿคโ€๐Ÿง‘ Speaker Diarization: Identifies multiple speakers within the audio and segments the transcription accordingly.
  4. ๐Ÿ“ˆ Audio Analytics: Sentiment analysis and keyword extraction from the transcriptions are powered by AssemblyAIโ€™s advanced processing.

Additional Tools and Prompts

  • โšก Qualified for the Really Rad Real-Time prompt by enabling live transcription and analysis in real-time.
  • โœจ Integrated Framer Motion to add delightful animations to the UI for better user interaction.

Team Submissions

๐Ÿ‘จโ€๐Ÿ’ป This is a solo submission by Aniruddha Adak. You can find the code on my GitHub repository.


Thanks for reading! ๐Ÿ˜Š

Top comments (1)

Collapse
 
hirusha_845a46eadc69225c2 profile image
Hirusha

Best Programming Codes to Sell
Get the best programming codes โ€” 5000+ codes to buy or download for free!
shorturl.at/lkfa8