Photo: Sylvain Crouzet

I am a Professor in the Department of Mathematics at the University of Wisconsin-Madison, where I belong to the Probability Group and Applied Mathematics Group. I am also a member of the Institute for Foundations of Data Science (IFDS), I have an affiliate appointment in the Department of Statistics and I am affiliated with the Theory of Computing Group in the Department of Computer Sciences. I can also be found at the Wisconsin Institute for Discovery. My CV is here.

I work at the intersection of applied probability, statistics and theoretical computer science, with an emphasis on biological applications. More details on my research interests and publications can be found here. My work is currently supported by NSF grants DMS-1902892, DMS-1916378, and DMS-2023239 (TRIPODS Phase II) as well as a Vilas Associates Award.


Dec 2020: The new website of IFDS has launched.

Sep 2020: The expanded Institute for Foundations of Data Science, now in a collaboration with the University of Washington, the University of California, Santa Cruz, and the University of Chicago has been funded through the NSF TRIPODS Phase II program.

Aug 2020: My Ph.D. student Brandon Legried successfully defended his thesis. He is now a postdoc in Deparment of Statistics at the University of Michigan. Congratulations!

Jun 2020: UW-Madison's new Data Science Major has officially launched. I serve on its program committee.

Oct 2019: I am on the scientific committee of the 41st Midwest Probability Colloquium, which will be held October 10-12, 2019 at Northwestern University.

Aug 2019: A new grant, joint with Karl Rohe, has been funded by the NSF.

Aug 2019: I am co-organizing a special session on Recent Trends in the Mathematics of Data at the AMS Fall Central Sectional Meeting, held in Madison on Sep 14-15, 2019.

May 2019: My new undergraduate course, MATH 535: Mathematical Methods in Data Science, has been approved by the university. It will be offered for the first time in Spring 2020. More details to come.

May 2019: A new grant, joint with Cecile Ane and Curtis Brandt, has been funded by the Joint DMS/NIGMS Initiative to Support Research at the Interface of the Biological and Mathematical Sciences

Apr 2019: Our cluster hire proposal "Foundations of Data Science" (with M. Newton, R. Nowak and S. Wright) has been selected by UW's Cluster Hiring Initiative.

Jan 2019: I received a Vilas Associates Award for 2019-2021.

Lecture notes and tutorials

Advanced undergraduate course on the mathematics of data

Graduate course on modern discrete probability

Topics course on stochastic processes in evolutionary genetics

First year of graduate probability theory

Brief survey of mathematical phylogenetics

Tutorial on sequence-length requirements in phylogenetics

Summer school slides on probability on graphs with applications to data science

Tutorial slides on mathematical phylogenomics

Selected Publications (full list here)

Polynomial-Time Statistical Estimation of Species Trees Under Gene Duplication and Loss
Proceedings of RECOMB 2020, 120-135. With Brandon Legried, Erin Molloy, and Tandy Warnow.
Generalized least squares can overcome the critical threshold in respondent-driven sampling
Proceedings of the National Academy of Sciences, 115(41):10299-10304, 2018. With Karl Rohe.
Circular Networks from Distorted Metrics
Proceedings of RECOMB 2018, 167-176. With J. Wang.
(Best Paper Award, RECOMB 2018)
Distance-based species tree estimation under the coalescent: information-theoretic trade-off between number of loci and sequence length
Ann. Appl. Probab., 27(5)-2926-2955, 2017. With E. Mossel.
Conference abstract in Proceedings of RANDOM 2015, 931-942.
Phase transition in the sample complexity of likelihood-based phylogeny inference
Probability Theory and Related Fields, 169(1), 3-62, 2017. With A. Sly.
Access the recommendation on F1000Prime
On the robustness to gene tree estimation error (or lack thereof) of coalescent-based species tree methods
Systematic Biology, 64(4):663--676, 2015. With T. Warnow.
Likelihood-based tree reconstruction on a concatenation of aligned sequence data sets can be statistically inconsistent
Theoretical Population Biology, 100:56-62, 2015. With M. Steel.
(Honorable Mention, 2018 Marcus W. Feldman Prize in Theoretical Population Biology.)
Access the recommendation on F1000Prime
Alignment-Free Phylogenetic Reconstruction: Sample Complexity via a Branching Process Analysis
Annals of Applied Probability, 23(2):693-721, 2013. With C. Daskalakis.
Conference abstract in Proceedings of RECOMB 2010, 123-137.
Submodularity of Influence in Social Networks: From Local to Global
SIAM J. Comput., 39(6):2176-2188, 2010. With E. Mossel.
Conference abstract in Proceedings of ACM STOC 2007, 128-134.
Toward Extracting All Phylogenetic Information from Matrices of Evolutionary Distances
Science, 327(5971):1376 - 1379, 2010. (Posted by permission of the AAAS for personal use, not for redistribution.)
A Short Proof that Phylogenetic Tree Reconstruction by Maximum Likelihood is Hard
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 3(1):92-94, 2006.

A full list of publications is available here.

Contact Information

Office: Van Vleck 823
Phone: 608-263-3053
Fax: 608-263-8891
Department of Mathematics
University of Wisconsin-Madison
480 Lincoln Drive
Madison, WI 53706
updated: 01/05/21