Scientific Lead, Applied Intelligence for Discovery
Eli Lilly
Location
San Francisco, CA
Job Type
Full-time
Posted
March 12, 2026
Views
8
Salary Range
$167k - $266k
USD
Job Description
Lilly is developing an AI foundation to transform drug discovery research. The Applied Intelligence for Discovery (AI4D) team seeks a leader to design and deploy core AI systems including retrieval-augmented generation, text-to-SQL interfaces, and agentic workflows that connect scientists to petabyte-scale data.
Key Responsibilities
- Build and optimize RAG pipelines for internal scientific documents, lab notebooks, and study reports
- Create hybrid retrieval systems combining vector search with metadata and ontology-aware filtering
- Develop text-to-SQL systems enabling natural language queries over genomic and proteomics databases
- Design schema documentation and semantic annotations bridging scientist workflows to data storage
- Implement multi-step reasoning approaches to improve accuracy on complex queries
- Engineer agentic workflows automating database queries, bioinformatics tools, and analysis visualization
- Evaluate orchestration frameworks (LangGraph, CrewAI) for scientific applications
- Build evaluation frameworks measuring accuracy, reliability, and scientific validity
Required Qualifications
- PhD in Computer Science/Data Science with 0-3+ years experience OR MS with 5+ years experience
Preferred Qualifications
- Experience with RAG systems, text-to-SQL, agentic workflows, or fine-tuning
- Python proficiency with production-grade systems
- Familiarity with embeddings, vector databases, orchestration frameworks
- LLM evaluation framework design experience
- AWS, Docker, CI/CD familiarity
- Biomedical/pharma/biotech environment experience
- Biomedical ontologies experience (Gene Ontology, MeSH, ChEBI)
Get Similar Jobs in Your Inbox
Weekly digest of top bioinformatics jobs. No spam.
Job Information
Source:
manual
Remote Type:
onsite
Experience:
Mid-Senior
Allowed Locations:
Worldwide
Skills & Tags:
AI
LLM
RAG
Python
drug discovery
bioinformatics
text-to-SQL
agentic workflows