We invest in visionary teams with transformative ideas.

Join Canaan's extended family.
195
companies
902
Jobs

Senior Data Scientist

Alkymi

Alkymi

Data Science
United States · Remote
USD 151k-185k / year + Equity
Posted on May 13, 2025

At Alkymi, we’re building the end-to-end solution for managing private markets data processing. As the first technology service to offer both machine learning and safe and secure large language models inside financial document workflows, Alkymi is changing how the private markets industry accesses their data—empowering firms to service more clients, quickly pivot investment strategies, and review deals faster.

Founded in 2017 in New York City, Alkymi works with some of the world’s leading private markets and financial services firms to automate their highest-impact workflows by delivering an unparalleled product experience. We’re laser-focused on understanding our customers’ workflows from top to bottom and building easy-to-use, powerful, tools to meet their objectives. We combine cutting-edge data science, machine learning, and LLMs with best-in-class software engineering to delight our users at some of the world’s most demanding and sophisticated firms.

Core Responsibilities:

The Senior Data Scientist is responsible for developing and implementing cutting-edge Natural Language Processing (NLP) algorithms that automate the processing of business documents. The Senior Data Scientist will work closely with our product and engineering teams to ensure that our solutions are scalable, efficient, and effective. Specific duties include: (1) design NLP pipelines to meet customer requirements; (2) Use different approaches for training entity tagging models (e.g. CRF, RNN, CNN, Transformer); (3) Analyze large datasets to identify patterns and insights that drive business decisions; (4) Stay up-to-date on the latest developments in computer vision and NLP research, and incorporate new techniques into our solutions; (5) Communicate technical concepts and insights to both technical and non-technical stakeholders; (6) Build and deploy systems for automating business processes; (7) use Python and source code management, debugging, testing, and deployment to develop software; (8) use text pre-processing and normalization techniques including tokenization and POS tagging for text document processing; (9) use asynchronous programming and frameworks such as Tensorflow, Pytorch, and Natural Language Processing deep learning algorithms for text document processing; and (10) oversee the work of and mentor junior data scientists. Telecommuting available from anywhere in US. HQ at 228 Park Ave S, Ste. 63730, New York, NY 10003.

Qualifications:

This position requires a Master’s degree or the equivalent in Computer Science, Mathematics, or a related field. Must have 2 years of related experience. Must also have 12 months of experience, as demonstrated through employment or academic coursework, with each of the following:

1) Analyzing large datasets to identify patterns and data insights; 2) Build and deploy systems for automating business processes; 3) Experience with Python, CRF, RNN, CNN, Transformer; Tensorflow; Pytorch; 4) Experience with source code management, debugging, testing, and deployment; 5) Experience with text pre-processing and normalization techniques, including tokenization and POS tagging for text document processing; and 6) Experience using Natural Language Processing deep learning algorithms for text document processing. Employer will accept experience gained before, during, or after a Master’s program.

Full-time, telecommuting available from anywhere in US. HQ at 228 Park Ave S, Ste. 63730, New York, NY 10003.

Please apply online at https://www.alkymi.io/company/careers.

Salary: $151,000 to $185,000/year. Other benefits include equity, fully paid company benefits, 401k, and unlimited PTO.