Principal Data Scientist - Oncology

Johnson & Johnson
Johnson & Johnson logo
Location
Spring House, PA / Cambridge, MA / San Diego, CA
Job Type
Full-time
Posted
June 5, 2026
Views
7
Salary Range
$117k - $201k USD

Job Description

Johnson & Johnson Innovative Medicine is recruiting for a Principal Data Scientist - Oncology to join our Data Science and Digital Health team (DSDH). This position will be located at one of our offices in either Spring House PA (preferred), Cambridge MA, or San Diego CA (La Jolla area). Consideration may be given for our Titusville and Raritan, NJ locations.

The Principal Data Scientist - Oncology will play a pivotal role to standardize and connect biomedical and clinical data. You will be a hands-on technical contributor with depth in semantic technologies, ontology, and graph data modeling, and strong familiarity with the life sciences domain. You will connect enterprise master data with R&D data across the entire product lifecycle so trusted, interoperable knowledge powers analytics, search, and AI across Johnson & Johnson Innovative Medicine.

Primary Responsibilities

  • Be a key contributor to the design and implementation of a scalable knowledge graph infrastructure focused on data standardization and interoperability, focusing on Oncology R&D data.
  • Apply graph-based data modeling for efficient Oncology R&D organization, integration and retrieval to ensure system flexibility and long-term maintainability.
  • Work with a larger community of Data Scientists, Clinical Scientists, and Discovery Scientists to standardize, curate and create AI-Ready datasets.
  • Curate and extend ontologies for clear mapping into established biomedical ontologies and controlled terminologies using resource description framework (RDF) standards.
  • Work with SPARQL/GraphQL/REST services; develop ingestion and curation pipelines to ingest, normalize and map concepts across data sources.
  • Extend and curate Oncology R&D-relevant ontologies (e.g., diseases, drugs, targets, pathways) and maintain synonyms, cross-references, and provenance.
  • Partner with cross-functional teams to enable NLP/RAG over graphs, features for predictive modeling and terminology services for search and study design tools.
  • Work with Data Science & Digital Health colleagues, IT and DevOps teams to deploy and manage the graph database infrastructure.
  • Draft and manage documentation, such as data dictionaries, data lineage, and data flow diagrams.

Preferred Qualifications

  • Ph.D. or Master's degree in bioengineering, computer science, IT, bioinformatics, physics, mathematics, or related fields, emphasis on semantic technologies for biomedical application.
  • 5+ years professional experience in health informatics.
  • Demonstrated experience in large-scale knowledge graphs construction, ontology development, pharmaceutical or healthcare domains integration.
  • Programming background in parser combinators, natural language processing, and linked data (RDF Triple Stores and property graphs).
  • Proficiency in semantic web technologies (e.g. SPARQL, RDF, OWL); familiarity with graph databases (Neo4j, Amazon Neptune).
  • Proven work with complex biomedical datasets (e.g. clinical, genomics, proteomics).
  • Proficiency in various data storage solutions (SQL, key-value, column, document, graph stores) and data modeling techniques.
  • Experience in CI/CD implementations, git usage, DevOps tools, and containerization technologies (Docker, Singularity).

The anticipated base pay range for this position is $117,000.00 - $201,250.00, plus eligibility for the Company's long-term incentive program.

Researching Johnson & Johnson before you apply?

See 5 open roles · Culture, benefits & locations.

View Johnson & Johnson profile

Frequently Asked Questions

Where is this job located, and what is the work-mode policy?
This position is located on-site at a Johnson & Johnson office in Spring House, PA (preferred), Cambridge, MA, or San Diego, CA (La Jolla). Titusville and Raritan, NJ locations may also be considered. No specific remote or hybrid policy is mentioned.
What qualifications and experience are preferred for this role?
Preferred candidates have a Ph.D. or Master's degree in bioengineering, computer science, bioinformatics, or a related field, and 5+ years of health informatics experience. You should have experience with knowledge graphs, semantic web technologies (SPARQL, RDF, OWL), and complex biomedical datasets.
What are the primary responsibilities of this position?
You will design and implement a scalable knowledge graph infrastructure for Oncology R&D data. Key tasks include standardizing biomedical data, curating ontologies using RDF standards, building ingestion pipelines, and collaborating with cross-functional teams to enable NLP/RAG over graphs.
What is the salary range for this role?
The anticipated base pay range for this position is $117,000.00 - $201,250.00. The role is also eligible for the Company's long-term incentive program.
Which team will I be working with?
You will join the Data Science and Digital Health (DSDH) team at Johnson & Johnson Innovative Medicine, collaborating with other Data Scientists, Clinical Scientists, Discovery Scientists, IT, and DevOps teams.

Ready to Apply?

Apply for this Position

You'll be redirected to the company's application page

Share this job:

Explore Johnson & Johnson

Research the company before you apply.

  • 5 open roles
  • Culture, benefits & locations
View company profile

Job Information

Source: manual
AI Relevance: 88/100 (Highly relevant)
Remote Type: hybrid
Experience: Principal
Allowed Locations: Worldwide
Skills & Tags:
data science knowledge graph ontology semantic technologies RDF SPARQL graph database oncology biomedical data NLP

Get Similar Jobs by Email

Weekly digest of Johnson & Johnson and similar companies. Free.

Related Jobs

Get weekly job alerts