Research Scientist in LLMs for scientific data exploration
Veröffentlicht:
26 Oktober 2024Pensum:
100%Vertrag:
Festanstellung- Arbeitsort:Zurich
The SIB Swiss Institute of Bioinformatics is an internationally recognized non-profit organization, dedicated to biological and biomedical data science. Its data scientists are passionate about creating knowledge and solving complex questions in many fields, from biodiversity and evolution to medicine. They provide essential databases and software platforms as well as bioinformatics expertise and services to academic, clinical, and industry groups. SIB federates the Swiss bioinformatics community of some 900 scientists, encouraging collaboration and knowledge sharing. The Institute contributes to keeping Switzerland at the forefront of innovation by fostering progress in biological research and enhancing health.
Curious? Please click here to learn more about working at SIB.
To reinforce our team in Zürich, Switzerland, we are seeking a
Research Scientist in LLMs for scientific data exploration
- To build upon state-of-the-art Large Language Models for developing innovative question answering systems over scientific data sources available as Knowledge Graphs.
- To leverage LLMs for Knowledge Graph construction (e.g., triple extraction from unstructured data).
- To contribute to the implementation of a system for generating SPARQL queries from scientific questions.
- To work on data harmonisation, create data schemas and perform data modelling using Semantic Web Technologies when appropriate.
- To build and maintain knowledge graphs based on the created or adopted data schema.
- To be able to negotiate and coordinate domain experts in life sciences to find an agreement for accurately defining domain-specific semantic models.
- Publish and present the resulting work as peer-reviewed articles at conferences and scientific journals.
- PhD degree in computer science or in a related field with maximum 5 years of relevant experience.
- Hands-on experience using and fine-tuning Large Language Models.
- Experience in Applied Machine Learning and LLMs to generate structured queries (e.g., SPARQL) would be a significant plus.
- Semantic Web Technologies’ expertise (e.g., RDF, RDFS, OWL, SPARQL, SHACL).
- Familiarity with ontology engineering best practices and natural language processing would be a plus.
- Software development (e.g., Python, Rust, Java, version control with Git).
- Experience with life science datasets (including biomedical data) is a plus.
- Proven ability to carry out independent research and software development.
- Excellent oral and written communication skills.
- Track record of publications in top-tier conferences and journals.
- Openness to working in a highly interdisciplinary and dynamic environment.
- Multi-tasking, openness to working on multiple projects with similar responsibilities.
- Proficiency in English is required. Speaking German is a plus but not mandatory.