Emily's Homepage

The Darwin Typeface Project

I'm currently working on Darwin, a free and open-source typeface for scientific and academic writing:

You can find more details about Darwin at its GitHub repository or website.

The Clowder Project

I'm also developing The Clowder Project, an online resource and reference for category theory based on Gerby:

The Clowder Project

The project is meant to become essentially an Stacks Project for category theory, though it will eventually also include some material in other areas. See the project website for more details.

Papers

Below you'll find some of the papers I've written or am currently working on:

Category Theory

Coends of Higher Arity, joint with Fosco Loregian. DOI: https://doi.org/10.1007/s10485-021-09653-x. arXiv: arXiv:2011.13881.

In this paper, we define and study a generalisation of co/ends for functors of the form $F\colon(\mathcal{C}^{\mathsf{op}})^{p}\times\mathcal{C}^{q}\to\mathcal{D}$.
Weighted Category Theory (in preparation), joint with Fosco Loregian.

In this paper, we study weighted variants of classical notions and constructions found in category theory, moving beyond the theory of weighted co/limits.

This gives rise to a number of interesting objects, such as weighted natural transformations, weighted ends, weighted Kan extensions, weighted adjunctions, and weighted monads.
Diagonal Category Theory (in preparation), joint with Fosco Loregian.

In this paper, we pursue a similar idea to the one in Weighted Category Theory, investigating analogues of classical notions and constructions in category theory obtained by replacing universality with respect to natural transformations by universality with respect to dinatural transformations.

This idea, which mimicks the passage from co/limits to co/ends, gives rise to a number of interesting notions and objects, such as diagonal adjunctions, diagonal Kan extensions, and diagonal monads.

Furthermore, equipped with the theory developed in Weighted Category Theory, we also define and study “higher diagonal” analogues of natural transformations, leading us to an $\mathbb{N}$-graded composition law assembling together natural, dinatural, and higher diagonal natural transformations.

This leads us to a conceptual solution of the compositionality problem for dinatural transformations―a problem which has plagued category theorists since the introduction of dinatural transformations in the early 70s―as we show the reason dinatural transformations fail to compose is solely because they are just the degree 1 part of a more general $\mathbb{N}$-graded composition law.

Other

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI, joint with Elliot Glazer, Ege Erdil, Tamay Besiroglu, Diego Chicharro, Evan Chen, Alex Gunning, Caroline Falkman Olsson, Jean-Stanislas Denain, Anson Ho, Olli Järviniemi, Matthew Barnett, Robert Sandler, Matej Vrzala, Jaime Sevilla, Qiuyu Ren, Elizabeth Pratt, Lionel Levine, Grant Barkley, Natalie Stewart, Bogdan Grechuk, Tetiana Grechuk, Shreepranav Varma Enugandla, and Mark Wildon. arXiv: arXiv:2411.04872.

From the abstract: We introduce FrontierMath, a benchmark of hundreds of original, exceptionally challenging mathematics problems crafted and vetted by expert mathematicians. The questions cover most major branches of modern mathematics -- from computationally intensive problems in number theory and real analysis to abstract questions in algebraic geometry and category theory. Solving a typical problem requires multiple hours of effort from a researcher in the relevant branch of mathematics, and for the upper end questions, multiple days. FrontierMath uses new, unpublished problems and automated verification to reliably evaluate models while minimizing risk of data contamination. Current state-of-the-art AI models solve under 2% of problems, revealing a vast gap between AI capabilities and the prowess of the mathematical community. As AI systems advance toward expert-level mathematical abilities, FrontierMath offers a rigorous testbed that quantifies their progress.
Humanity's Last Exam, joint with 731 other authors. arXiv: arXiv:2411.04872.

From the abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage. HLE consists of 2,700 questions across dozens of subjects, including mathematics, humanities, and the natural sciences. HLE is developed globally by subject-matter experts and consists of multiple-choice and short-answer questions suitable for automated grading. Each question has a known solution that is unambiguous and easily verifiable, but cannot be quickly answered via internet retrieval. State-of-the-art LLMs demonstrate low accuracy and calibration on HLE, highlighting a significant gap between current LLM capabilities and the expert human frontier on closed-ended academic questions. To inform research and policymaking upon a clear understanding of model capabilities, we publicly release HLE at this https URL.

Contact

You can contact me at the following places:

Email: emily.de.oliveira.santos.tmf@gmail.com.
Discord: You can contact me using my username, "yui_emily".
TypeDrawers
GitHub
Twitter
Mathstodon
Category Theory Zulip

You can also find me on MathOverflow.