Computational Linguist (French)
Text to Speech
Are you Brave, Wise, Proud and ready to Exceed? Are you passionate, driven to reach goals and objectives, have excellent attention to details and have experience in a client focused environment?
Covalen is a trusted outsourcing partner for leading global organisations. We’re a diverse team of innovators and achievers – proud of our ability to consistently exceed goals.
Our client is a social networking company that operates on a worldwide level, engaging in the development of social media applications for people to connect through mobile devices, personal computers, and other surfaces.
THE ROLE
As a Computational Linguist for Text to Speech, your role will be to work on improving TTS quality in the language of your expertise. These experts are needed to:
1. Make informed judgements of quality
2. Improve aspects of the pipeline (text normalization, pronunciation prediction)
3. Pre-emptively identify problems specific to a new language, design test sets to illustrate these problems, and potentially help design solutions to those problems.
DUTIES AND RESPONSIBILITIES
Prior to Launch
1. Create a regression test for each locale within their language
2. Create and maintain a text normalization testset for their language
3. Source and vet datasets used in training of DD TN systems, and/or craft guidelines for external annotation programs used to generate those datasets
4. Develop a set of text normalization rules for their language that guarantees certain accuracy against the testset (the TN rules are written in JavaScript)
5. Create and maintain a pronunciation golden set for G2P evaluation
6. Identify/evaluate/solve language-specific pain-points, such as grammatical gender, word stress, segmentation, tone prediction, word case / declension
7. Perform targeted data quality checks
8. Audio evaluation
After Launch
1. Fix all frontend bugs reported for the language via the methods in the Linguist Runbook, maintaining the regression test with each bug
2. Continue improving text normalization
3. Ensure each deployment of the voice passes Capability Testing referenced in the Launch Review Process
4. Perform ongoing Audio Evaluation
CANDIDATE PROFILE
Ideal candidate is a native or near-native speaker who majored/minored in linguistics, with some computational experience (or deep interest and willingness to learn).
Essential competencies needed for this role are:
1. Native or near-native (C1/C2) speaker of the market language
2. Advanced/fluent (C1/C2) level of English
3. Undergraduate degree in linguistics or similar
4. Demonstrated knowledge of International Phonetic Alphabet
5. Some command line (Linux/Ubuntu) experience
Other competencies desirable for the role are:
1. Some Python experience preferred
2. Some JavaScript experience preferred
COMPENSATION PACKAGE & BENEFITS
1. Work from Home after training (first 2 months from the office in Dublin South)
2. Performance bonus
3. Private healthcare
4. Pension contribution
5. Tax Saver and Bike-to-Work Scheme
6. Full training provided with career development opportunities
7. Be part of a great, friendly, diverse team
We can consider only applicants eligible to work full-time in Ireland. Thank you for understanding.
Keywords: Natural Language Processing, NLP, Large Language Models, LLM, Linguistics, Linux, Ubuntu, phonetics, International Phonetic Alphabet, artificial intelligence, AI
#J-18808-Ljbffr