Research summary
The STRING database integrates direct (physical) and indirect (functional) protein-protein associations from automated literature text mining, interaction-experiment databases, curated pathway sources, and computational predictions based on co-expression and conserved genomic context. Version 10 expanded coverage to more than 2,000 organisms with scalable algorithms for cross-species transfer of interaction evidence [4]; v11 introduced increased coverage and tooling explicitly designed to support functional enrichment in genome-wide experimental datasets [1]; the 2021 release added customizable networks and functional characterization for user-uploaded gene/measurement sets [5]; and the 2023 release extended enrichment analysis to arbitrary sequenced genomes of interest [6]. The 2017 release prioritized quality control and broader accessibility through a redesigned web interface [7]. A parallel strand quantifies the genetic potential of the human gut microbiome. Illumina-based metagenomic sequencing of faecal samples from 124 European individuals produced 576.7 Gb of sequence and a non-redundant catalogue of 3.3 million microbial genes — roughly 150 times the size of the human gene complement — with the observation that most of these genes are shared across the cohort and that more than 99% are bacterial [2]. A third strand supports phylogenetic visualization. The Interactive Tree Of Life (iTOL) v5 rewrites the tree display engine, adds a MEME-motif dataset type, extends annotation options to non-numerical categorical values and multiple values per node, and provides direct manual drawing and labeling tools on rendered trees [3]. Across all of these resources, the engineering choices prioritize web accessibility, scalable integration of heterogeneous evidence types, and direct linkage between sequence-level data and higher-order biological context.
Recent publications
- STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasetsDOI
- A method and server for predicting damaging missense mutationsDOI
- A human gut microbial gene catalogue established by metagenomic sequencingDOI
- Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotationDOI
- STRING v10: protein–protein interaction networks, integrated over the tree of lifeDOI
- The STRING database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement setsDOI
- The STRING database in 2023: protein–protein association networks and functional enrichment analyses for any sequenced genome of interestDOI
- Enterotypes of the human gut microbiomeDOI
- The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessibleDOI
- Initial sequencing and comparative analysis of the mouse genomeDOI
The lab page does not clearly state student acceptance status. Email the professor directly to confirm.
How to apply
Email Peer Bork 6-12 months before your application deadline. Read several recent papers and reference specific work in your message. Use our how to email a Japanese professor guide for the proven email structure.
For applications via MEXT scholarship: see our MEXT 2027 complete guide and university-specific University Recommendation track.
External profiles
- ORCID: https://orcid.org/0000-0002-2627-833X
- OpenAlex: openalex.org
Profile compiled from public sources (Researchmap, OpenAlex, The University of Tokyo faculty directory). Last refreshed 2026-05. Report incorrect information.