Machine Learning/Data Scientist

Protein Evolution

Protein Evolution

Data Science, Software Engineering
Guilford, CT, USA
Posted 6+ months ago

Location: Strong preference for a candidate to sit in Connecticut or California. This will be a remote opportunity.


Company Overview:

Protein Evolution is a rapidly growing biotech company that is pioneering enzyme engineering for a sustainable future. Our team of scientific minds is committed to developing groundbreaking solutions that enable the transition to a circular economy, reducing carbon emissions and creating a cleaner, healthier planet. We use artificial intelligence to design enzymes that break down plastic waste and create new, high-quality plastic materials. By combining advancements in enzyme and chemical engineering, our technology breaks down end-of-life textile and plastic waste into the building blocks that make up new textile and plastic products – helping companies, communities and governments meet their sustainability goals, while reducing their reliance on fossil fuels.

We foster a dynamic culture that encourages employees to take ownership of their projects while prioritizing collaboration, accountability, and responsibility. Our emphasis on open communication and championing diverse perspectives creates an environment that fuels innovation and inclusivity, empowering every individual to contribute their unique talents to our collective success.

Job Overview:

We are seeking an experienced machine learning professional with a passion for natural language models to join our AI and Design team. The successful candidate will have a proven track record of developing and implementing machine learning models and algorithms to solve complex problems. They will be responsible for designing, training, and maintaining machine learning models (transformers, diffusers, and others) that power our enzyme design process.

We are looking for a talented collaborator who enjoys working with a diverse range of scientists including bioengineers, biochemists, biophysicists, microbiologists, enzymologists, and chemical engineers. Besides building and training machine learning models, you’ll also have opportunities to jump in and help with whatever is needed to get the job done - working with terabytes of data, analyzing and incorporating lab-generated data into model training, helping improve our data/ML pipelines - all while helping solve the world’s plastic pollution problem.


Responsibilities:

  • Prototype and build deep learning algorithms for protein structure prediction and design
  • Pioneer machine learning research and engineering at the cutting edge of research to aid development of enzymes
  • Collaborate with cross-functional teams to identify and prioritize areas for AI research and development
  • Stay up-to-date with the latest advances in machine learning and artificial intelligence and apply this knowledge to improve our products and processes
  • Communicate complex AI concepts and results to non-technical stakeholders

Qualifications:

  • Ph.D. in Computer Science, Engineering, Statistics, or a related field.
  • 5 years in Data Science and/or at least 2 years of experience in machine learning and artificial intelligence.
  • Experience developing natural language models and transformers
  • Strong programming skills with experience in Python libraries for data science analysis (e.g. NumPy, Pandas, Matplotlib)
  • Strong experience with deep learning frameworks (TensorFlow and/or PyTorch)
  • Experience with running inference on and training deep learning models on GPUs
  • Experience with command-line tools, unix environment, shell scripting, and cloud computing (AWS, GCP, Azure)
  • Previous experience with or willingness to learn bioinformatics tools needed for day-to-day job duties
  • Scientific curiosity to learn and understand protein engineering concepts and translate them into machine learning questions
  • Excellent communication and collaboration skills
  • Strong problem-solving skills and the ability to think creatively

Nice-to-haves (but not required):

  • Previous experience with protein bioinformatics databases (PDB, Uniprot, Pfam, and others) and tools (HMMER, MMseqs, BLAST, PyMol) is preferred but not required.
  • Previous experience with ML-based protein design models (AlphaFold, ESM, RFdiffusion) is preferred but not required.
  • Experience with containers (Docker) and big data tools (PySpark) is preferred but not required.

Additional Information

We offer great perks:

  • Fully covered medical insurance plan including dental & vision - we place great value on the well-being of our team members!
  • Competitive salaried compensation - we value our employees and show it
  • Preferred locations for this opportunity are Connecticut and California. Salary range for a candidate based in CT: $113,400-153,154. Salary range for a candidate based in CA: $147,600-199,343. Actual pay will vary based on various factors, including but not limited to location, skill, and experience.
  • Highly competitive equity compensation - we want every employee to be a stakeholder
  • Pre-tax commuter benefits - we make your commute more reasonable
  • Free onsite meals + kitchen stocked with snacks
  • 401k plan - we facilitate your retirement goals
  • Flexible time off - Our time off policies allow you to recharge and come back to work ready to make an impact

Protein Evolution is an E-Verify and equal opportunity employer promoting diversity and inclusion in the workspace. We do not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical conditions, veteran status, sexual orientation, gender (including gender identity and gender expression), sex (which includes pregnancy, childbirth, and breastfeeding), genetic information, taking or requesting statutorily protected leave, or any other basis protected by law. All your information will be kept confidential according to EEO guidelines.

Legal authorization to work in the United States is required. In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification form upon hire.