Job Description
TheData Science and Scientific Informatics Teamat Research and Development Sciences IT (RaDS-IT) of our Company is seeking a Lead Data Scientist for Representation Learning.
Our team is a diverse collection of scientists and engineers working towards the same goal - enabling and accelerating the next generation of pharmaceutical sciences. We collaborate closely with laboratory and in silico scientists, proposing and implementing innovative solutions that enable new organizational capabilities.
An ideal candidate will have a strong background in computational biology, machine learning, and chemistry, with a focus on developing and applying advanced methods for molecular and protein design. This role will involve creating and optimizing foundation models that support the design and evaluation of novel therapeutic candidates, in close collaboration with RaDS-IT product lines supporting the corresponding functionalities under Discovery Chemistry, Discovery Biologics, and IDVAX.
Key Responsibilities:
Molecular Representation Learning:
Develop, validate, and implement state-of-the-art machine learning and deep learning algorithms for molecular representation, focusing on capturing complex chemical properties and biological activities.
Utilize various techniques, including graph neural networks and transformer architectures, to enhance molecular and protein representations.
Collaborate with cross-functional teams to contribute in the design of novel small molecules and protein constructs tailored to specific therapeutic targets.
Protein Design:
Apply computational tools and methodologies for de novo protein design and engineering such as RF-Diffusion, ProteinMPNN and AlphaFold, using AI-driven approaches to predict protein stability, function, and interaction.
Oversee the integration of structural biology data into machine learning models to improve predictive capabilities.
Foundation Models:
Lead initiatives to develop foundation models that enable scalable and efficient molecular and protein design workflows.
Conduct research on transfer learning and few-shot learning to maximize model performance on diverse datasets.
Data Management and Collaboration:
Manage and curate large-scale datasets relevant to molecular and protein design, ensuring data integrity and accessibility for team members.
Collaborate closely with experimental chemists, biologists, data scientists, and other product teams on RaDS-IT to translate computational insights into practical a
Mentorship and Leadership:
Provide mentorship to junior scientists and researchers on the team, fostering an environment of creativity and scientific rigor.
Contribute to strategic planning and project prioritization within the team and at the higher level of the organization.
Publications and Presentations:
Lead efforts in publishing research findings in peer-reviewed journals and presenting at conferences.
Stay abreast of advancements in molecular representation learning and related fields to inform ongoing research and development.
Required Skills:
Ph.D. in Computational Biology, Bioinformatics, Chemistry, Machine Learning, or a related field, with 3+ years of experience in industry (including full time job and internship/co-op)
Proven experience in molecular and/or protein design, with a strong publication record in relevant areas.
Proficient in programming languages such as Python and R, and familiarity with ML frameworks (e.g., TensorFlow, PyTorch).
Strong understanding of molecular modeling software and tools (e.g., RDKit, OpenMM, AlphaFold, RosettaFold, MPNN, RF-Diffusion).
Excellent communication skills and ability to work collaboratively in a multidisciplinary team.
Deep knowledge of statistical methods and experimental design as applied to computational biology.
Experience with large-scale datasets and big data analytics techniques.
Optional Skills:
Experience with cloud computing platforms (e.g., AWS, Google Cloud) for computational modeling and data analysis.
Familiarity with cheminformatics and bioinformatics databases and tools (e.g., ChEMBL, UniProt).
Knowledge of synthetic chemistry or organic chemistry principles.
Experience in project management and leading cross-functional research initiatives.
Understanding of regulatory requirements in drug development.
Exposure to emerging AI techniques, such as reinforcement learning or generative models.
Current Employees apply HERE
Current Contingent Workers apply HERE
US and Puerto Rico Residents Only:
Our company is committed to inclusion, ensuring that candidates can engage in a hiring process that exhibits their true capabilities. Please click here if you need an accommodation during the application or hiring process.
As an Equal Employment Opportunity Employer, we provide equal opportunities to all employees and applicants for employment and prohibit discrimination on the basis of race, color, age, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or other applicable legally protected characteristics.As a federal contractor, we comply with all affirmative action requirements for protected veterans and individuals with disabilities. For more information about personal rights under the U.S. Equal Opportunity Employment laws, visit:
We are proud to be a company that embraces the value of bringing together, talented, and committed people with diverse experiences, perspectives, skills and backgrounds. The fastest way to breakthrough innovation is when people with diverse ideas, broad experiences, backgrounds, and skills come together in an inclusive environment. We encourage our colleagues to respectfully challenge one another's thinking and approach problems collectively.
Learn more about your rights, including under California, Colorado and other US State Acts
U.S. Hybrid Work Model
Effective September 5, 2023, employees in office-based positions in the U.S. will be working a Hybrid work consisting of three total days on-site per week, Monday - Thursday, although the specific days may vary by site or organization, with Friday designated as a remote-working day, unless business critical tasks require an on-site presence.This Hybrid work model does not apply to, and daily in-person attendance is required for, field-based positions; facility-based, manufacturing-based, or research-based positions where the work to be performed is located at a Company site; positions covered by a collective-bargaining agreement (unless the agreement provides for hybrid work); or any other position for which the Company has determined the job requirements cannot be reasonably met working remotely. Please note, this Hybrid work model guidance also does not apply to roles that have been designated as "remote".
The Company is required to provide a reasonable estimate of the salary range for this job in certain states and cities within the United States. Final determinations with respect to salary will take into account a number of factors, which may include, but not be limited to the primary work location and the chosen candidate's relevant skills, experience, and education.
Expected US salary range:
$153,800.00 - $242,200.00Available benefits include bonus eligibility, long term incentive if applicable, health care and other insurance benefits (for employee and family), retirement benefits, paid holidays, vacation, and sick days. A summary of benefits is listed here.
San Francisco Residents Only:We will consider qualified applicants with arrest and conviction records for employment in compliance with the San Francisco Fair Chance Ordinance
Los Angeles Residents Only:We will consider for employment all qualified applicants, including those with criminal histories, in a manner consistent with the requirements of applicable state and local laws, including the City of Los Angeles' Fair Chance Initiative for Hiring Ordinance
Search Firm Representatives Please Read Carefully
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company. No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails.
Employee Status:
RegularRelocation:
DomesticVISA Sponsorship:
NoTravel Requirements:
25%Flexible Work Arrangements:
HybridShift:
Not IndicatedValid Driving License:
NoHazardous Material(s):
n/aRequired Skills:
Business, Business Intelligence (BI), Computational Biology, Computer Programming, Data Analysis, Database Design, Data Engineering, Data Modeling, Data Science, Data Visualization, Drug Development, Machine Learning, Organic Chemistry, Pharmaceutical Sciences, Project Management, Project Prioritization, Protein Modeling, R&D Management, Social Collaboration, Software Development, Stakeholder Relationship Management, Strategic Management, Strategic Planning, Structural Biology, Waterfall ModelPreferred Skills:
Job Posting End Date:
06/19/2025*A job posting is effective until 11:59:59PM on the day BEFOREthe listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.
Requisition ID:R350476


