Sr. Language Data Scientist, Alexa International Tech
Job ID: 2799518 | Amazon.com Services LLC
Alexa International Tech (AIT) team is looking for a passionate, talented, and inventive Sr. Language Data Scientist to help build industry-leading technology with Large Language Models (LLMs) and multimodal systems, requiring strong deep learning and generative models knowledge.
You will collaborate with fellow language data scientists, program managers, as well as stakeholders in science, engineering, and product teams to understand the role data plays in developing data sets and exemplars that meet customer needs. You will analyze and automate processes for collecting and annotating LLM inputs and outputs to assess data quality and measurement. You will apply state-of-the-art Generative AI techniques to analyze how well our data represents human language and run experiments to gauge downstream interactions. You will work collaboratively with other language data scientists and scientists to design and implement principled strategies for data optimization.
Key Job Responsibilities
- Collaborate with Scientists and Software Engineers to help design APIs and evaluate performance of LLM.
- Produce and manipulate different types of language data, analyze, and provide efficient solutions.
- Design and lead a data collection; define scope and target, provide a guideline and training, guide teams cross sites to meet the quality bar, and run evaluation of data for hand off.
- Engineer prompts to guide generative AI to produce desired outputs in context.
- Automate operations and perform data analysis using Python or other scripting languages.
- Advocate strict adherence to annotation guidelines.
- Test and deploy changes to Alexa's language understanding codebase.
- Identify and solve production issues that are impacting the Alexa customer experience.
- Collaborate with other linguists, scientists, and designers in creating optimal solutions to elevate the customer experience.
- Own the customer-facing machine learning and deterministic models for a specific domain of features.
- Use modeling tools to bootstrap and test new functionalities.
BASIC QUALIFICATIONS
- 5+ years of data querying languages (e.g. SQL), scripting languages (e.g. Python) or statistical/mathematical software (e.g. R, SAS, Matlab, etc.) experience.
- 4+ years of data scientist experience.
- Experience with statistical models e.g. multinomial logistic regression.
PREFERRED QUALIFICATIONS
- Experience as a leader and mentor on a data science team.
- Expertise in Computational Linguistics or Natural Language Processing.
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit .
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $143,300/year in our lowest geographic market up to $247,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit .
#J-18808-Ljbffr