S. Cenk Sahinalp, Ph.D.

S. Cenk Sahinalp, Ph.D.
Senior Investigator

My research focuses on developing algorithmic methods for managing, storing, communicating and analyzing high-throughput sequencing data, especially in the context of cancer.

Areas of Expertise

1) computational biology, 2) algorithms and combinatorial optimization, 3) genomics,
4) biomolecular networks, 5) transcriptomics, 6) intra-tumor heterogeneity and tumor evolution

Contact Info

S. Cenk Sahinalp, Ph.D.
Center for Cancer Research
National Cancer Institute
Building 10, Room 6N119
Bethesda, MD 20892
Ph: 240-858-3169
cenksahinalp@gmail.com

A key focus of my lab is discovery and interpretation of large-scale (especially structural) genomic and transcriptomic variants in tumor samples. Our algorithmic methods for genomic structural variation discovery, including VariationHunter, CommonLAW, DeStruct and NovelSeq, were the first with the ability to handle novel insertions, deletions, inversions and duplications in repeat regions of the human genome. More recently I have been interested in applying the algorithmic technology we developed for structural variant discovery to exact genotyping high copy number, structurally variant genes, e.g., those involved in drug metabolism – for which my group has developed Cypiripi and Aldy methods. My group has also contributed to the identification and quantification of transcriptomic aberrations, in particular gene fusions, as well as genic inversions, duplications and deletions in cancer samples. Leading computational methods that we developed include DeFuse, NFuse, Comrad MiStrVar and SVICT – the last one with the ability to handle circulating cell-free tumor DNA data. My recent interests include modeling tumor evolution and heterogeneity through both bulk and single-cell sequencing (CITUP, CTP-Single, Remix-T and BSCITE) and network-aided, integrative analysis of genomic and transcriptomic sequence data from tumor samples (Hit’nDrive and cdCAP). Finally, I have an ongoing interest in what I would like to call “algorithmic infrastructure” for genomics, including (i) mapping (of especially reads from repetitive regions of the genome – or involving reads with high error rates – examples include mrFAST, mrsFAST, drFAST and lordFAST),  (ii) genomic data compression (SCALCE, DeeZ and AssemblTrie) and (iii) secure/privacy preserving computing (PrivStrat and SkSES).

NIH Scientific Focus Areas:
Computational Biology, Genetics and Genomics, Systems Biology
  1. Hach F, Hormozdiari F, Alkan C, Birol I, Eichler EE, Sahinalp SC.
    Nature Methods. 7(8): 576-7, 2010. [ Journal Article ]
  2. McPherson A, Wu C, Wyatt AW, Shah S, Collins C, Sahinalp SC.
    Genome Research. 22(11): 2250-61, 2012. [ Journal Article ]
  3. Shrestha R, Hodzic E, Sauerwald T, Dao P, Wang K, Yeung J, Anderson S, Vandin F, Haffari G, Collins CC, Sahinalp SC.
    Genome Research. 27(9): 1573-88, 2017. [ Journal Article ]
  4. Numanagić I, Malikić S, Ford M, Qin X, Toji L, Radovich M, Skaar TC, Pratt VM, Berger B, Scherer S, Sahinalp SC..
    Nature Communications. 9(1): 828, 2018. [ Journal Article ]
  5. Numanagić I, Bonfield JK, Hach F, Voges J, Ostermann J, Alberti C, Mattavelli M, Sahinalp SC.
    Nature Methods. 13(12): 1005-8, 2016. [ Journal Article ]

S. Cenk Sahinalp received his B.Sc. in Electrical Engineering at Bilkent University, Ankara, Turkey and his Ph.D. in Computer Science from the University of Maryland, College Park. His Ph.D. thesis was on parallel and serial algorithms for string/sequence processing. After a brief postdoctoral fellowship at Bell Labs, Murray Hill he has been a faculty member at the University of Warwick UK, Case Western Reserve University, Simon Fraser University (where he was a Canada Research Chair in Computational Genomics while being affiliated with the Vancouver Prostate Centre) and most recently at Indiana University, Bloomington, all in Computer Science. During his research career, his research attention shifted from combinatorial pattern matching algorithms and data structures to their applications in genome sequence analysis, especially in the context of cancer. Nevertheless much of his recent work in computational genomics uses combinatorial optimization techniques.

 

In the past two decades Sahinalp has directed or participated in a number of large-scale research projects funded by U.S. and Canadian sources on the use of high-throughput sequencing data for better characterization of the structure, evolution and heterogeneity of cancer genomes. He has (co)trained more than two dozen Ph.D. students and postdocs, many of whom now hold independent academic and research positions in the U.S., Canada and elsewhere. He is also actively engaged in the computational biology community, having organized RECOMB 2011 in Vancouver, BC, chairing the program committee of RECOMB 2017 in Hong Kong, founding RECOMB-Seq, and currently serving on the steering committee of RECOMB.

Name Position
Kaiyuan Zhu Predoctoral Visiting Fellow (Graduate Student)
Mohammad Haghir Ebrahimabadi Predoctoral Visiting Fellow (Graduate Student)
Can Kockan Predoctoral Visiting Fellow (Graduate Student)
Cindy Li Predoctoral Visiting Fellow (Graduate Student)
Farid Mehrabadi Predoctoral Visiting Fellow (Graduate Student)