Software

Scroll horizontally for more information
Author(s)TitleDescriptionLanguageDocumentationCodePapers
Cécile AnéPhyloNetworksInference and manipulation of phylogenetic networks, and their use for trait evolutionJuliahttps://crsl4.github.io/PhyloNetworks.jl/dev/https://github.com/crsl4/PhyloNetworks.jlhttps://doi.org/10.1093/molbev/msx235
Cécile AnéPhyloPlotsphylogenetic network visualizationhttps://cecileane.github.io/PhyloPlots.jl/dev
Cécile AnéQuartetNetworkGoodnessFitphylogenetic networks analyses using four-taxon subsetshttps://cecileane.github.io/QuartetNetworkGoodnessFit.jl/dev/
Cécile AnéPhyloCoalSimulationssimulate phylogenies under the coalescenthttps://cecileane.github.io/PhyloCoalSimulations.jl/dev/
Vivak Patel, Daniel Adrian MaldonadoRLinearAlgebra.jlDeploy randomized linear solversJuliahttps://numlinalg.github.io/RLinearAlgebra.jl/dev/https://github.com/numlinalg/RLinearAlgebra.jl
Sameer DeshpandeflexBARTA faster and more flexible implementation of Bayesian Additive Regression TreesR, C++https://github.com/skdeshpande91/flexBARThttps://arxiv.org/abs/2211.04459
Owen Melia, Eric Jonas, Rebecca WillettRotation Invariant Random FeaturesCode for "Rotation-Invariant Random Features Provide a Strong Baseline
for Machine Learning on 3D Point Clouds"
Pythonhttps://github.com/meliao/rotation-invariant-random-features
Suzanna Parkinson, Greg Ongie, Rebecca WillettLinear layers in neural networksCode for https://arxiv.org/abs/2305.15598Pythonhttps://github.com/suzannastep/linearlayershttps://arxiv.org/abs/2305.15598
Elena Orlova, Aleksei Ustimenko, Ruoxi Jiang, Peter Y. Lu, Rebecca WillettDeep Stochastic MechanicsCode for https://arxiv.org/abs/2305.19685Pythonhttps://github.com/elena-orlova/deep-stochastic-mechanicshttps://arxiv.org/abs/2305.19685
Yuming Chen, Daniel Sanz-Alonso, Rebecca WillettReduced-Order Autodifferentiable Ensemble Kalman Filters (ROAD-EnKF)Code for https://arxiv.org/abs/2301.11961Pythonhttps://github.com/ymchen0/ROAD-EnKFhttps://arxiv.org/abs/2301.11961
Yuming Chen, Daniel Sanz-Alonso, Rebecca WillettAuto-differentiable Ensemble Kalman Filters (AD-EnKF)Code for https://epubs.siam.org/doi/abs/10.1137/21M1434477Pythonhttps://github.com/ymchen0/torchEnKFhttps://epubs.siam.org/doi/abs/10.1137/21M1434477
Elena Orlova, Haokun Liu, Raphael Rossellini, Benjamin Cash, Rebecca WillettBeyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal ForecastingCode for https://arxiv.org/abs/2211.15856Pythonhttps://github.com/elena-orlova/SSF-projecthttps://arxiv.org/abs/2211.15856
Ruoxi Jiang, Rebecca WillettEmbed and Emulate: Learning to estimate parameters of dynamical systems with uncertainty quantificationCode for https://arxiv.org/abs/2211.01554Pythonhttps://github.com/roxie62/Embed-and-Emulatehttps://arxiv.org/abs/2211.01554
Joseph Shenouda, Rahul Parhi, Kangwook Lee, Robert D. NowakVector-Valued Variation Spaces and Width Bounds for DNNs: Insights on Weight Decay RegularizationCode for https://arxiv.org/abs/2305.16534Pythonhttps://github.com/joeshenouda/vv-spaces-nn-widthhttps://arxiv.org/abs/2305.16534
Karan SrivastavaGenerating large isosceles-free lattice subsets with reinforcement learningCode for training models and repository maintained for researchPythonhttps://github.com/ksrivastava1/isosceles_triangles_rl
Karl RohelongpcaThis package introduces a novel formula syntax for PCA. In modern applications (where data is often in "long format"), the formula syntax helps to fluidly imagine PCA without thinking about matrices. In other words, it provides a layer of abstraction above matrices.https://karlrohe.github.io/longpca/
Karl RohegdimThis package estimates graph dimension using cross-validated eigenvalues, via the graph-splitting technique. Theoretically, the method works by computing a special type of cross-validated eigenvalue which follows a simple central limit theorem. This allows users to perform hypothesis tests on the rank of the graph.https://rohelab.github.io/gdim/https://arxiv.org/abs/2108.03336
Nicholas Henderson/Michael NewtonrvaluesA collection of functions for computing "r-values" from various kinds of user input such as MCMC output or a list of effect size estimates and associated standard errorshttps://cran.r-project.org/web/packages/rvalues/index.html
Zihao Zheng/Michael A. NewtonMixTwiceImplements large-scale hypothesis testing by variance mixing. It takes two statistics per testing unit, an estimated effect and its associated squared standard error, and fits a nonparametric, shape-constrained mixture separately on two latent parametershttps://cran.r-project.org/web/packages/MixTwice/index.html

Books

Ellenberg, Jordan, Shape: The Hidden Geometry of Information, Biology, Strategy, Democracy, and Everything Else, Penguin Books, 2022. https://www.penguinrandomhouse.com/books/612131/shape-by-jordan-ellenberg/

Wright, Stephen J., and Benjamin Recht, Optimization for Data Analysis, Cambridge University Press, 2022. https://doi.org/10.1017/9781009004282

Diakonikolas, Ilias, and Daniel M. Kane, Algorithmic High-Dimensional Robust Statistics, Cambridge University Press, 2023. https://doi.org/10.1017/9781108943161

Nan Chen, Stochastic Methods for Modeling and Predicting Complex Dynamical Systems, Springer Cham, 2024. https://doi.org/10.1007/978-3-031-22249-8

Roch, Sebastien, Modern Discrete Probability: An Essential Toolkit, Cambridge University Press, 2024. https://doi.org/10.1017/9781009305129

Educational Materials & Tools

Online textbook on “Mathematical Methods in Data Science (with Python)” by Sebastien Roch. https://mmids-textbook.github.io/

Online tutorial on “Comparative methods on reticulate phylogenies”. https://cecileane.github.io/networkPCM-workshop/

Online textbook on “Causal Inference” by Amy Cochran: https://amy-cochran.gitbook.io/causal-inference

Lecture notes for a year-long course on the “Mathematics of Data Science” by Dmitriy Drusvyatskiy: https://sites.math.washington.edu/~ddrusv/crs/Math_581_2023/MATH581.html

Lecture notes and video lectures for a course on the “Mathematical Foundations of Machine Learning” by Rebecca Willett: https://willett.psd.uchicago.edu/teaching/mathematical-foundations-of-machine-learning-fall-2021/

Autograder: a server to automatically grading coding assignments. https://github.com/edulinq/autograder-server

Quiz Generator: allows a general format for quiz banks, that then can be uploaded in a variety of forms (gradescope, Canvas, pdf, html, qti, etc.); support for latex, wide variety of question types; and allows support for collaboration by using standard tools like git for managing questions. https://github.com/edulinq/quizgen

Canvas Tools: a suite of tools and Python interface for Instructure’s Canvas LMS. https://github.com/edulinq/py-canvas