Sony is hiring Speech Recognition Intern | Upto 50k/month | Remote

Sony Research India

Job Description
Sony Research India is inviting applications for the role of Speech Recognition Intern. This is a remote, paid internship opportunity for candidates who are passionate about deep learning and speech technologies. The selected intern will work closely with experienced researchers to address real-world challenges in automatic speech recognition (ASR), including robustness under noisy conditions and minimizing hallucinations in transcription.
Interns will gain hands-on experience with state-of-the-art models and datasets and will contribute to impactful projects with potential for technical publications or open-source contributions.
Key Responsibilities
Education Requirement
Technical Skills
Python, PyTorch, TensorFlow, Torchaudio, ESPnet, Hugging Face Transformers, Wav2Vec2, Whisper, RNN-T, Academic Paper Implementation, Machine Learning, Signal Processing
Preferred Skills
Prompt Tuning, Contrastive Learning, Multi-modal Architectures, Hallucination Evaluation, Synthetic Speech Generation, Audio Perturbation Techniques
Work Mode
No company description available.