my profile

About Me

(Github) (Google Scholar) (Linkedin)

My name is Chengze Shen (CV) and I am a fifth-year (started 2020) PhD student @ UIUC Computer Science program. My advisor is Tandy Warnow.

QUAL: PASSED
PRELIM: PASSED
I expect to graduate on June 30th, 2025 and plan to work as a Machine Learning Engineer at TikTok.

My Research

I am a computer scientist with strong programming skills and a broad background in bioinformatics and machine learning techniques.
I develop scalable solutions and software for challenging problems with ultra-large data in biology, sometimes using techniques in machine learning and deep learning.

My research interests broadly include Machine Learning and Bioinformatics. More specifically, I have experience in:

Talks and Presentations

  1. [June 20, 2025] Evolution 2025, software workshop. Link
  2. [November 11, 2024] PPoSS Meeting, Clusterings and Multiple Sequence Alignments.
  3. [September 4, 2023] ACM-BCB/WABI 2023, EMMA: accepted paper presentation. Link
  4. [September 1, 2022] Sandia National Lab Presentation on Scalable Alignment and Tree Estimation. Link

News

  1. [April 15, 2025] BSCAMPP has been accepted for publication at IEEE Transactions on Computational Biology and Bioinformatics!
  2. [April 5, 2025] TIPP3 is now available online at PLOS Computational Biology!
  3. [March 2, 2025] TIPP3 has been accepted for publication at PLOS Computational Biology!
  4. [Decemeber 7, 2023] EMMA has been formally published on Algorithms for Molecular Biology. DOI link
  5. [November 19, 2023] "EMMA: A New Method for Computing Multiple Sequence Alignments Given a Constraint Subset Alignment" has been accepted for publication in Algorithms for Molecular Biology!
  6. [September 3-6, 2023] Attended ACM-BCB/WABI 2023 for presentation and poster session for new multiple sequence alignment research results - EMMA. See Publications for the EMMA paper.
  7. [April 2023] Currently collaborating with Kelly P. Williams from Sandia National Lab on developing alignment methods that deal with missing clade in input data.

Teaching Assistantship

function foo(items) { var x = "Just a neat text editor"; return x; }

2025

  1. Eleanor Wedell, Chengze Shen, and Tandy Warnow. BSCAMPP: Batch-Scaled Phylogenetic Placement on Large Trees. IEEE Transactions on Computational Biology and Bioinformatics, 2025, 1-14. (Link)
  2. Chengze Shen, Eleanor Wedell, Mihai Pop, and Tandy Warnow. TIPP3 and TIPP3-Fast: Improved Abundance Profiling in Metagenomics. PLOS Computational Biology 21, no. 4 (April 4, 2025): e1012593. (Link)

2023

  1. Chengze Shen, Baqiao Liu, Kelly P. Williams, and Tandy Warnow. EMMA: A New Method for Computing Multiple Sequence Alignments Given a Constraint Subset Alignment. Algorithms for Molecular Biology 18, no. 1 (December 7, 2023): 21. (Link)
  2. Eleanor Wedell, Chengze Shen, and Tandy Warnow. BATCH-SCAMPP: Scaling phylogenetic placement methods to place many sequences. Under review at IEEE Transaction Computational Biology and Bioinformatics. An early version was published in Workshop in Algorithms for Bioinformatics 2023. (Link)
  3. Mihir Mongia*, Chengze Shen*, Arash Gholami Davoodi, Guillaume Marcais, and Hosein Mohimani. Large Scale Sequence Alignment via Efficient Inference in Generative Models. Scientific Reports 13, no. 1 (May 4, 2023): 7285. (Link)
  4. Minhyuk Park, Stefan Ivanovic, Gillian Chu, Chengze Shen, and Tandy Warnow. UPP2: Fast and Accurate Alignment of Datasets with Fragmentary Sequences.” Bioinformatics 39, no. 1 (January 1, 2023): btad007. (Link)

2022

  1. Chengze Shen, Minhyuk Park, and Tandy Warnow. WITCH: Improved Multiple Sequence Alignment Through Weighted Consensus Hidden Markov Model Alignment. Journal of Computational Biology, May 17, 2022. (Link) (pdf)
  2. Chengze Shen, Paul Zaharias, and Tandy Warnow, MAGUS+eHMMs: improved multiple sequence alignment accuracy for fragmentary sequences, Bioinformatics, Volume 38, Issue 4, 15 February 2022, Pages 918-924. (Link) (pdf)

Metagenomics

Multiple Sequence Alignment

Phylogenetic Placement

                            Bone

            I have a bone

            You have a bone              or do you?
            
            Oh, I have a lot of bones

            And you don't