Current Research and Future Direction

I am building on this work by refining my model of spatial transcriptomic data to take into consideration the differences between assayed gene expression and activity predicted from morphological profiling via computer vision. This allows isolation of the effects from the spatial distribution of cells on the slide. This should reduce bias when finding cell interaction patterns from gene expression correlations across space. The model avoids the weaknesses of automated vision in certain aspects of understanding image composition in order to pick up patterns involving multiple cell types as well as avoiding the circular reasoning inherent in using expression for scRNA-seq cell clustering and aggregation.

This view would be useful in any organoid or solid tissue samples where the arrangement of cells on the assayed slide is expected to affect gene expression. For example, this could be used to look at immune infiltrates and muscle-tendon communication. This could also be used to look at how the epigenetically induced neuroplasticity from certain serotonergic compounds drives rewiring of maladaptive neural network connectivity patterns, providing a molecular mechanism for their observed effectiveness in addiction and mental health treatment.

My previous work looked at continuous variation of epigenetic data, such as cell type transitions, by viewing the cells as reference points in a map identifying position with gene expression looking for spatial transitions instead of the pseudotime analysis used in single cell work. This builds on that approach by looking more closely at the cell imaging data and using it to determine which cells are similar prior to the expression and position data, greatly increasing statistical power. I hope to integrate this into a spatial transcriptomics analysis computing kit to be called STACK.

Spatial transcriptomics is a great way for me to combine my RNA-seq analysis experience and abstract geometrical training because, assuming smooth expression variation over space, the biology is easily modeled using concepts from differential geometry. This allows correlation of internal cell state with morphology and local environment.

I am open to working on any other projects that could benefit from a mathematical perspective. It is worthwhile to use models that minimize assumptions and false positives at the cost of increased computing time because sequencing is still much more expensive than processor cycles. The recent advancements in sequencing technology have greatly increased the need for careful statistical treatment of NGS data since the massive number of reads produced allows for many places to search for patterns and avoid false positives. Multiomics has increased the sophistication of appropriate models significantly, requiring deeper mathematical expertise in biology.

More generally, I hope to promote the use of dynamic models in NGS analysis in order to connect the data with the biology. This allows for more information to be used, such as recognizing that transcript count is related to protein production rate. Time series data in particular can benefit from this approach. Every program is written with assumptions in mind and using the right model for the experiment at hand is key. For example, the fact that genes and mRNA are generally thought of as abstractions of proteins means that, when doing analysis about mRNA and genes, it is important to keep in mind what is happening to the final protein product. This is relevant to most epigenetic projects, since analysis of noncoding regions and epigenetic markers is usually tied to nearby genes.

As of November 2025 I have several manuscripts in progress including projects about MET amplification in lung cancer and cell differentiation in AITL. I am also working on using more differentiable and dynamical systems models in RNA-seq analysis in general. I am constantly refining my tools and analysis philosophy using my mathematical and statistical knowledge and training.

Stay tuned!

Previous Work and Education

Before working in biology at MGH and DFCI, I worked in industrial process automation writing software for electrical grid management and tax fraud detection at hmx.ai. Before that I worked in complex systems research at NECSI using ideas from physics in development economics. While there, I helped write an article featured in the CFC Annual Report 2017.

I focused on the differential geometry used to characterize continuous spacetime symmetries in mathematical physics for my undergraduate honors thesis at Washington University in St. Louis advised by Xiang Tang called: Lie Groups and Lie Algebras, for which I was awarded highest honors, inducted into Sigma Xi and chosen as an MAA student nominee member. I worked on the software for the IEEE LED Modular Dance Floor. I was also jazz director of KWUR 90.3 FM where I had a show called Rhomboid Dreamscape on which I performed live surreal dream analysis on callers and played jazz, classical and electronic music for the people of St. Louis.

While a student at Wellesley High School I worked at Northeastern University doing statistical profiling for fuel cell catalyst design in the lab of Eugene Smotkin, taught Java and C++ classes to advanced middle schoolers, and was given the Bausch and Lomb award as the top science student in my graduating class. I also threw shot for the track and field team and played bass in the band that was a national finalist in the Essentially Ellington Jazz Festival.

See my Google Scholar for a complete list of publications and metrics

About Me

I am a collaborative researcher who maintains strong relationships with co-authors in Boston and around the world. I try to take a biologically informed and statistically sound approach to computational data analysis. I enjoy learning from and teaching others. I have general quantitative training that could be useful in any scientific field.

I believe that healthcare, food, and shelter are human rights. In my free time I enjoy music, dancing, and weightlifting. I am also interested in philosophy and history. I love comedy and reading random articles on Wikipedia. I live in Cambridge, MA.

My pronouns are he/him.

Niko Kesten

Mathematical biologist using spatial data to understand how cells interact.

Research Statement

Computational Molecular Biology Work

Current Research and Future Direction

Previous Work and Education

About Me

Contact Information