RECOMB 2022 Proceedings

San Diego, CA, USA, May 22–25, 2022.

PC Chair: Itsik Pe’er, Columbia University
Organization Committee: Vineet Bafna (chair), Vikas Bansal, Jocelyn Bernardo, Melissa Gymrek, Siavash Mirarab, Glenn Tesler, Kaiyuan Zhu
Keynote Speakers: Regina Barzilay, Howard Y. Chang, John Chodera, Lenore Cowen, John Marioni, Bing Ren, Wenyi Wang

List of Publications

  • Unsupervised integration of single-cell multi-omics datasets with disparities in cell-type representation. Pinar Demetci, Rebecca Santorella, Bjorn Sandstede, Ritambhara Singh.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 3-19, Springer, Cham.
    • Preprint: bioRxiv 2021.11.09.467903
    • Journal: SCOTv2: Single-Cell Multiomic Alignment with Disproportionate Cell-Type Representation. Pinar Demetci, Rebecca Santorella, Manav Chakravarthy, Bjorn Sandstede, Ritambhara Singh. Journal of Computational Biology, 29(11): 1213-1228, 2022.
  • Semi-supervised single-cell cross-modality translation using Polarbear. Ran Zhang, William Noble, Laetitia Meng-Papaxanthos, Jean-Philippe Vert.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 20-35, Springer, Cham.
    • Preprint: bioRxiv 2021.11.18.467517
    • Journal: Multimodal Single-Cell Translation and Alignment with Semi-Supervised Learning. Ran Zhang, Laetitia Meng-papaxanthos, Jean-philippe Vert, William Stafford Noble. Journal of Computational Biology, 29(11): 1198-1212, 2022.
  • Transcription Factor-Centric Approach to Identify Non-Recurring Putative Regulatory Drivers in Cancer. Jingkang Zhao, Vincentius Martin, Raluca Gordan.
  • DeepMinimizer: A Differentiable Framework for Optimizing Sequence-Specific Minimizer Schemes. Quang Minh Hoang, Hongyu Zheng, Carl Kingsford.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 52-69, Springer, Cham.
    • Preprint: bioRxiv 2022.02.17.480870
    • Journal: Differentiable Learning of Sequence-Specific Minimizer Schemes with DeepMinimizer. Quang Minh Hoang, Hongyu Zheng, Carl Kingsford. Journal of Computational Biology, 29(12): 1288-1304, 2022.

-MetaCoAG: Binning Metagenomic Contigs via Composition, Coverage and Assembly Graphs. Vijini Mallawaarachchi, Yu Lin.

  • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 70-85, Springer, Cham.
  • Preprint: bioRxiv 2021.09.10.459728
  • Journal:Accurate Binning of Metagenomic Contigs Using Composition, Coverage, and Assembly Graphs. Vijini Mallawaarachchi, Yu Lin. Journal of Computational Biology, 29(12): 1357-1376, 2022.
  • A Fast, Provably Accurate Approximation Algorithm for Sparse Principal Component Analysis Reveals Human Genetic Variation Across the World. Agniva Chowdhury, Aritra Bose, Samson Zhou, David P Woodruff, Petros Drineas.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 86-106, Springer, Cham.
  • Gene Set Priorization Guided by Regulatory Networks with p-values through Kernel Mixed Model. Haohan Wang, Oscar Lopez, Wei Wu, Eric Xing.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 107-125, Springer, Cham.
    • Journal: Kernel Mixed Model for Transcriptome Association Study. Haohan Wang, Oscar Lopez, Eric P. Xing, Wei Wu. Journal of Computational Biology, 29(12): 1353-1356, 2022.
  • Real-valued Group Testing for Quantitative Molecular Assays. Seyran Saeedi, Myrna Serrano, Dennis G. Yang, J. Paul Brooks, Gregory A. Buck, Tom Arodz.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 126-142, Springer, Cham.
    • Journal: Group Testing Matrix Design for PCR Screening with Real-Valued Measurements. Seyran Saeedi, Myrna Serrano, Dennis G. Yang, J. Paul Brooks, Gregory A. Buck, Tomasz Arodz. Journal of Computational Biology, 29(12): 1397-1411, 2022.
  • On the Effect of Intralocus Recombination on Triplet-Based Species Tree Estimation. Max Hill, Sebastien Roch.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 143-158, Springer, Cham.
    • Preprint: bioRxiv 2021.11.06.467557
    • Journal: Inconsistency of Triplet-Based and Quartet-Based Species Tree Estimation under Intralocus Recombination 8. Max Hill, Sebastien Roch. Journal of Computational Biology, 29(11): 1173-1197, 2022.
  • QT-GILD: Quartet Based Gene Tree Imputation Using Deep Learning Improves Phylogenomic Analyses Despite Missing Data. Sazan Mahbub, Shashata Sawmya, Arpita Saha, Rezwana Reaz, M Sohel Rahman, Md Shamsuzzoha Bayzid.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 159-176, Springer, Cham.
    • Preprint: bioRxiv 2021.11.03.467204
    • Journal: Quartet Based Gene Tree Imputation Using Deep Learning Improves Phylogenomic Analyses Despite Missing Data. Sazan Mahbub, Shashata Sawmya, Arpita Saha, Rezwana Reaz, M Sohel Rahman, Md Shamsuzzoha Bayzid. Journal of Computational Biology, 29(11): 1156-1172, 2022.
  • Safe and Complete Flow Decompositions for RNA Assembly. Shahbaz Khan, Milla Kortelainen, Manuel Caceres, Lucia Williams, Alexandru I Tomescu.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 177-192, Springer, Cham.
    • Preprint: arXiv:2201.10372
    • Journal: Improving RNA Assembly via Safety and Completeness in Flow Decompositions. Shahbaz Khan, Milla Kortelainen, Manuel Caceres, Lucia Williams, Alexandru I Tomescu. Journal of Computational Biology, 29(12): 1270-1287, 2022.
  • NetMix2: Unifying network propagation and altered subnetworks. Uthsav Chitra, Tae Yoon Park, Ben Raphael.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 193-208, Springer, Cham.
    • Preprint: bioRxiv 2022.01.31.478575
    • Journal: NetMix2: A Principled Network Propagation Algorithm for Identifying Altered Subnetworks. Uthsav Chitra, Tae Yoon Park, Benjamin J. Raphael. Journal of Computational Biology, 29(12): 1305-1323, 2022.
  • Multi-Modal Genotype and Phenotype Mutual Learning to Single-Modal Input for Longitudinal Outcome Prediction. Alireza Ganjdanesh, Jipeng Zhang, Wei Chen, Heng Huang.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 209-229, Springer, Cham.
    • Journal: Multimodal Genotype and Phenotype Data Integration to Improve Partial Data-Based Longitudinal Prediction. Alireza Ganjdanesh, Jipeng Zhang, Sarah Yan, Wei Chen, Heng Huang. Journal of Computational Biology, 29(12): 1324-1345, 2022.
  • Fast, Flexible, and Exact Minimum Flow Decompositions via ILP. Fernando H C Dias, Lucia Williams, Brendan Mumey, Alexandru I Tomescu.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 230-245, Springer, Cham.
    • Preprint: arXiv:2201.10923
    • Journal: Efficient Minimum Flow Decomposition via Integer Linear Programming. Fernando H C Dias, Lucia Williams, Brendan Mumey, Alexandru I Tomescu. Journal of Computational Biology, 29(11): 1252-1267, 2022.
  • Co-linear chaining with overlaps and gap costs. Chirag Jain, Daniel Gibney, Sharma V Thankachan.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 246-262, Springer, Cham.
    • Preprint: bioRxiv 2021.02.03.429492
    • Journal: Algorithms for Colinear Chaining with Overlaps and Gap Costs. Chirag Jain, Daniel Gibney, Sharma V Thankachan. Journal of Computational Biology, 29(11): 1237-1251, 2022.
  • The Complexity of Approximate Pattern Matching on De Bruijn Graphs. Daniel Gibney, Sharma V Thankachan, Srinivas Aluru.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 263-278, Springer, Cham.
    • Preprint: arXiv:2201.12454
    • Journal: On the hardness of sequence alignment on de Bruijn graphs. Daniel Gibney, Sharma V Thankachan, Srinivas Aluru. Journal of Computational Biology, 29(12): 1377-1396 , 2022.
  • ProTranslator: zero-shot protein function prediction using textual description. Hanwen Xu, Sheng Wang.
  • Single-cell multi-omic velocity infers dynamic and decoupled gene regulation. Chen Li, Maria Virgilio, Kathleen Collins, Joshua Welch.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 297-299, Springer, Cham.
    • Preprint: bioRxiv 2021.12.13.472472
    • Journal: Multi-omic single-cell velocity models epigenome-transcriptome interactions and improves cell fate prediction. Chen Li, Maria C. Virgilio, Kathleen L. Collins, Joshua D. Welch. Nature Biotechnology, 41(3):387-398, 2023.
  • Ultrafast and Interpretable Single-cell 3D Genome Analysis with Fast-Higashi. Ruochi Zhang, Tianming Zhou, Jian Ma.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 300-301, Springer, Cham.
    • Preprint: bioRxiv 2022.04.18.488683
    • Journal: Ultrafast and interpretable single-cell 3D genome analysis with Fast-Higashi. Ruochi Zhang, Tianming Zhou, Jian Ma. Cell Systems, 13(10): 798-807.e6, 2022.
  • DiffDomain enables identification of structurally reorganized topologically associating domains. Dunming Hua, Ming Gu, Yanyi Du, Li Qi, Xiangjun Du, Zhidong Bai, Xiaopeng Zhu, Dechao Tian.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 302-303, Springer, Cham.
    • Preprint: bioRxiv 2022.12.05.519135
    • Journal: DiffDomain enables identification of structurally reorganized topologically associating domains. Dunming Hua, Ming Gu, Xiao Zhang, Yanyi Du, Hangcheng Xie, Li Qi, Xiangjun Du, Zhidong Bai, Xiaopeng Zhu, Dechao Tian. *Nature Communications *, 15: 502, 2024.
  • Joint inference of repeated evolutionary trajectories and patterns of clonal exclusivity or co-occurrence from tumor mutation trees. Xiang Ge Luo, Jack Kuipers, Niko Beerenwinkel.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 304-305, Springer, Cham.
    • Preprint: bioRxiv 2021.11.04.467347
    • Journal: Joint inference of repeated evolutionary trajectories and patterns of clonal exclusivity or co-occurrence from tumor mutation trees. Xiang Ge Luo, Jack Kuipers, Niko Beerenwinkel. *Nature Communications *, 14: 3676, 2023.
  • Fast and Optimal Sequence-to-Graph Alignment Guided by Seeds. Pesho Ivanov, Benjamin Bichsel, Martin Vechev.
  • CLMB: deep contrastive learning for robust metagenomic binning. Pengfei Zhang, Zhengyuan Jiang, Yixuan Wang, Yu Li.
  • Unsupervised cell functional annotation for single-cell RNA-Seq. Dongshunyi Li, Jun Ding, Ziv Bar-Joseph.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 349-352, Springer, Cham.
    • Preprint: bioRxiv 2021.11.20.469410
    • Journal: Unsupervised cell functional annotation for single-cell RNA-seq. Dongshunyi Li, Jun Ding, Ziv Bar-Joseph. Genome Research, 32(9): 1765-1775, 2022.
  • A novel matrix factorization model for interpreting single-cell gene expression from biologically heterogeneous data. Kun Qian, Shiwei Fu, Hongwei Li, Wei Vivian Li.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 353-355, Springer, Cham.
    • Preprint: bioRxiv 2021.10.13.464306
    • Journal: scINSIGHT for interpreting single-cell gene expression from biologically heterogeneous data. Kun Qian, Shiwei Fu, Hongwei Li, Wei Vivian Li. Genome Biology, 23(1): 82 20, 2022.
  • Tractable and expressive generative models of genetic variation data. Meihua Dang, Anji Liu, Xinzhu Wei, Sriram Sankararaman, Guy Van den Broeck.
  • CONCERT: Genome-wide prediction of sequence elements that modulate DNA replication timing. Yang Yang, Yuchuan Wang, Yang Zhang, Jian Ma.
  • CORSID enables de novo identification of transcription regulatory sequences and genes in coronaviruses. Chuanyi Zhang, Palash Sashittal, Mohammed El-Kebir.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 360-362, Springer, Cham.
    • Preprint: bioRxiv 2021.11.10.468129
    • Journal: Accurate Identification of Transcription Regulatory Sequences and Genes in Coronaviruses. Chuanyi Zhang, Palash Sashittal, Michael Xiang, Yichi Zhang, Ayesha Kazi, Mohammed El-Kebir. *Molecular Biology and Evolution *, 39(7): msac133, 2022.
  • Learning probabilistic protein-DNA recognition codes from DNA-binding specificities using structural mappings. Joshua Wetzel, Kaiqian Zhang, Mona Singh.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 363-365, Springer, Cham.
    • Preprint: bioRxiv 2022.01.31.477772
    • Journal: Learning probabilistic protein-DNA recognition codes from DNA-binding specificities using structural mappings. Joshua L Wetzel, Kaiqian Zhang, Mona Singh . Genome Research, 32(9):1776-1786, 2022.
  • Uncertainty quantification using subsampling for assembly-free estimates of genomic distance and phylogenetic relationships. Eleonora Rachtman, Shahab Sarmashghi, Vineet Bafna, Siavash Mirarab.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 366-368, Springer, Cham.
    • Journal: Uncertainty Quantification Using Subsampling for Assembly-Free Estimates of Genomic Distance and Phylogenetic Relationships. Eleonora Rachtman, Shahab Sarmashghi, Vineet Bafna, Siavash Mirarab. Cell Systems, 13(10): 817-829.e3, 2022.
  • SOPHIE: viral outbreak investigation and transmission history reconstruction in a joint phylogenetic and network theory framework. Pavel Skums, Fatemeh Mohebbi, Viachaslau Tsyvina, Pelin Burcak Icer, Yury Khudyakov.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 369-370, Springer, Cham.
    • Preprint: bioRxiv 2022.05.05.490757
    • Journal: SOPHIE: Viral outbreak investigation and transmission history reconstruction in a joint phylogenetic and network theory framework. Pavel Skums, Fatemeh Mohebbi, Vyacheslav Tsyvina, Pelin Icer Baykal, Alina Nemira, Sumathi Ramachandran, Yury Khudyakov. Cell Systems, 13(10):844-856.e4, 2022.
  • Identifying systematic variation at the single-cell level by leveraging low-resolution population-level data. Elior Rahmani, Michael Jordan, Nir Yosef.
  • Belayer: Modeling distinct cell type clusters and continuous variation of expression in spatial transcriptomics from layered tissues. Cong Ma, Uthsav Chitra, Shirley Zhang, Benjamin Raphael.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 372-373, Springer, Cham.
    • Preprint: bioRxiv 2022.02.05.479261
    • Journal: Belayer: Modeling discrete and continuous spatial variation in gene expression from spatially resolved transcriptomics. Cong Ma, Uthsav Chitra, Shirley Zhang, Benjamin J Raphael. Cell Systems, 13(10): 786-797.e13, 2022.
  • Lossless Indexing with Counting de Bruijn Graphs. Mikhail Karasikov, Harun Mustafa, Gunnar Ratsch, Andre Kahles.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 374-376, Springer, Cham.
    • Preprint: bioRxiv 2021.11.09.467907
    • Journal: Lossless indexing with counting de Bruijn graphs. Mikhail Karasikov, Harun Mustafa, Gunnar Ratsch, Andre Kahles. Genome Research, 32(9):1754-1764, 2022.
  • Uncovering hidden assembly artifacts: when unitigs are not safe and bidirected graphs are not helpful. Amatur Rahman, Paul Medvedev.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 377-379, Springer, Cham.
    • Preprint: bioRxiv 2022.01.20.477068
    • Journal: Assembler artifacts include misassembly because of unsafe unitigs and underassembly because of bidirected graphs. Amatur Rahman, Paul Medvedev. Genome Research, 32(9): 1746-1753, 2022.
  • Mapping single-cell transcriptomes to copy number evolutionary trees. Pedro F Ferreira, Jack Kuipers, Niko Beerenwinkel.
  • ImmunoTyper-SR: A Novel Computational Approach for Genotyping Immunoglobulin Heavy Chain Variable Genes using Short Read Data. Michael Ford, Ananth Hari, Oscar Rodriguez, Junyan Xu, Justin Lack, Cihan Oguz, Yu Zhang, Sarah Weber, Mary Magliocco, Jason Barnett, Sandhya Xirasagar, Smilee Samuel, Luisa Imberti, Paolo Bonfanti, Andrea Biondi, Clifton Dalgard, Stephen Chanock, Lindsey Rosen, Steven Holland, Helen Su, Luigi Notarangelo, Uzi Vishkin, Corey Watson, S Cenk Sahinalp.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 382-384, Springer, Cham.
    • Preprint: bioRxiv 2022.01.31.478564
    • Journal: ImmunoTyper-SR: A computational approach for genotyping immunoglobulin heavy chain variable genes using short-read data. Michael K. B. Ford, Ananth Hari, Oscar Rodriguez, Junyan Xu, Justin Lack, Cihan Oguz, Yu Zhang, Sarah Weber, Mary Magliocco, Jason Barnett, Sandhya Xirasagar, Smilee Samuel, Luisa Imberti, Paolo Bonfanti, Andrea Biondi, Clifton L. Dalgard, Stephen Chanock, Lindsey Rosen, Steven Holland, Helen Su, Luigi Notarangelo, NIAID COVID Consortium; Uzi Vishkin, Corey T. Watson, S. Cenk Sahinalp. Cell Systems, 13(10): 808-816.e5, 2022.
  • AutoComplete: Deep Learning-based Phenotype Imputation for Large-scale Biomedical Data. Ulzee An, Na Cai, Andy Dahl, Sriram Sankararaman.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 385-386, Springer, Cham.
    • Preprint: bioRxiv 2022.08.15.503991
    • Journal: Deep learning-based phenotype imputation on population-scale biobank data increases genetic discoveries. Ulzee An, Ali Pazokitoroudi, Marcus Alvarez, Lianyun Huang, Silviu Bacanu, Andrew J. Schork, Kenneth Kendler, Päivi Pajukanta, Jonathan Flint, Noah Zaitlen, Na Cai, Andy Dahl. Sriram Sankararaman. Nature Genetics, 55(12):2269-2276, 2023.
  • Resistor: an algorithm for predicting resistance mutations using Pareto optimization over multistate protein design and mutational signatures. Nathan Guerin, Teresa Kaserer, Bruce Donald.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 387-389, Springer, Cham.
    • Preprint: bioRxiv 2022.01.18.476733
    • Journal: Resistor: An algorithm for predicting resistance mutations via Pareto optimization over multistate protein design and mutational signatures. Nathan Guerin, Andreas Feichtner, Eduard Stefan, Teresa Kaserer, Bruce Donald. Cell Systems, 3(10):830-843.e3, 2022.
  • Ultra high diversity factorizable libraries for efficient therapeutic discovery. Zheng Dai, Sachit Saksena, Geraldine Horny, Christine Banholzer, Stefan Ewert, David Gifford.
    • Proceedings: Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science, vol 13278, pp 390-392, Springer, Cham.
    • Preprint: bioRxiv 2022.01.17.476670
    • Journal: Ultra-high-diversity factorizable libraries for efficient therapeutic discovery. Zheng Dai, Sachit D Saksena, Geraldine Horny, Christine Banholzer, Stefan Ewert, David K Gifford. Genome Research, 32(9):1787-1794, 2022.