11 West 19th Street (22008), United States of America, New York, New York
Principal Associate, Data Scientist - US Card (Generative AI Systems)
Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually personalizing every credit card offer using statistical modeling and the relational database, cutting edge technology in 1988! Fast-forward a few years, and this little innovation and our passion for data has skyrocketed us to a Fortune 200 company and a leader in the world of data-driven decision-making.
As a Data Scientist at Capital One, you’ll be part of a team that’s leading the next wave of disruption at a whole new scale, using the latest in computing and machine learning technologies and operating across billions of customer records to unlock the big opportunities that help everyday people save money, time and agony in their financial lives.
Team Description
The Generative AI Systems (Genesis) team within Card Data Science builds state-of-art, generative AI-based solutions for dialogue, text summarization, reading comprehension, speech recognition, and image processing. We partner with product, tech, and design teams to deliver internal applications based on these solutions that drive efficiency in our business and data analytics teams, as well as customer-facing applications that enhance the customer experience. You will work with a seasoned group of Natural Language Processing (NLP), Speech, and Computer Vision specialists, experimenting with emerging technologies in generative AI, delivering software implementing these technologies, and contributing research to major NLP and AI conferences.
Role Description
In this role, you will:
- Develop applications powered by Large Language Models across multiple modalities (text, speech, and vision).
- Partner with a cross-functional team of data scientists, software engineers, and product managers to deliver a product customers love.
- Leverage a broad stack of technologies — PyTorch, Hugging Face, Spark, LangChain and more — to reveal the insights hidden within huge volumes of both structured and unstructured data.
- Build machine learning models through all phases of development, from design through training, evaluation, validation, and implementation.
- Contribute research to top-tier NLP conferences such as Association for Computational Linguistics (ACL) and Empirical Methods in Natural Language Processing (EMNLP).
- Flex your interpersonal skills to translate the complexity of your work into tangible business goals.
The Ideal Candidate is:
- Customer first. You love the process of analyzing and creating, but also share our passion to do the right thing. You know at the end of the day it’s about making the right decision for our customers.
- A leader. You challenge conventional thinking and work with stakeholders to identify and improve the status quo. You're passionate about talent development for your own team and beyond.
- Technical. You’re comfortable with open-source languages and are passionate about developing further. You have hands-on experience developing data science solutions using open-source tools and cloud computing platforms.
- Statistically-minded. You’ve built models, validated them, and backtested them. You know how to interpret a confusion matrix or a ROC curve. You have experience with clustering, classification, sentiment analysis, time series, and deep learning.
- A data guru. “Big data” doesn’t faze you. You have the skills to retrieve, combine, and analyze data from a variety of sources and structures. You know understanding the data is often the key to great data science.
Basic Qualifications:
- Currently has, or is in the process of obtaining a Bachelor’s Degree plus 5 years of experience in data analytics, or currently has, or is in the process of obtaining a Master’s Degree plus 3 years in data analytics, or currently has, or is in the process of obtaining PhD, with an expectation that required degree will be obtained on or before the scheduled start date.
- At least 1 year of experience in open source programming languages for large scale data analysis.
- At least 1 year of experience with machine learning.
- At least 1 year of experience with relational databases.
Preferred Qualifications:
- Master’s Degree in “STEM” field (Science, Technology, Engineering, or Mathematics) plus 3 years of experience in data analytics, or PhD in “STEM” or STEM-adjacent field (Science, Technology, Engineering, or Mathematics).
- At least 1 year of experience working with AWS.
- At least 3 years’ experience in Python, Scala, or R.
- At least 3 years’ experience with machine learning.
- At least 3 years’ experience with SQL.
The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked.
New York City (Hybrid On-Site): $165,100 - $188,500 for Principal Associate, Data Science.
Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter.
This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI). Incentives could be discretionary or non-discretionary depending on the plan.
Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.
Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being.
This role is expected to accept applications for a minimum of 5 business days. No agencies please. Capital One is an equal opportunity employer committed to diversity and inclusion in the workplace.
#J-18808-Ljbffr