Reflections from MBL STAMPS 2024

Austin's experience and recap of the STAMPS 2024 workshop

Introduction

Hey there, I’m Austin Marshall, a first year postdoctoral fellow in the Villapol and Treangen Labs where I work on trimming the knowledge gap between microbiologists and computer scientists. This blog will be related to my experience of the Strategies and Techniques for Analyzing Microbial Population Structures (STAMPS) workshop that I attended this past July.

Venue

Location

The STAMPS workshop is held at the University of Chicago, Marine Biological Laboratory. The Marine Biological Laboratory (MBL) is located in Wood’s Hole, Massachusetts and is the closest mainland town to Martha’s Vineyard.

History

I was unaware of the immensely deep research history that the Woods Hole Oceanographic Institute and Marine Biology Laboratory have until I was tired of having my thinking cap on one evening and started looking around the amazing displays where I stumbled upon this.

Check out this awesome Nobel prize they have on display near the Lillie Library, it was awarded to Thomas Hunt Morgan for his discovery of the role chromosomes play in heredity. Countless nobel laureates, defining figures in biology, and just amazing researchers had walked through these same halls that I was fortunate enough to have the opportunity to as well. For a wild list of nobel laureates affiliated with the MBL check out this page.

Topics and Presenters

The course description of the STAMPS workshop is pretty spot-on. (nanopore sequencing pun) We went through the analysis of short-read and long-read amplicon sequencing analysis as well as metagenomic sequence analysis. Personally, I believe the Stats Day was the most useful part for me but this could be due to my background and experience level.

The speaker lineup for the STAMPS workshop is pretty insane. The lectures began with an introduction into sequencing modalities and 16S sequence analysis by Dr. Ben Callahan. Following Dr. Callahan, we were introduced into metagenomics and kmer based analyses by Dr. C. Titus Brown who is definitely a leader in this field as his group is responsible for the creation and maintenance of sourmash, khmer, and spacegraphcats.

After Dr. Brown gave the basis of metagenomic asequencing and analysis it was time for my group to step up to the plate. My co-PI Dr. Todd J. Treangen and STAMPS alumni / research scientist in the Treangen Lab Dr. Michael G. Nute gave a very in-depth look into metagenomic assembly including a hands on demonstration of reference-based vs. de-novo assembly methods and also gave some insight into multiprocessing and how we as a class only ran our assembly on one core…

Behind the scenes of the metagenomic assembly tutorial, shown above a cinnamon bun powered window size estimator.

The next day we were introduced to the fascinating research of Dr. Curtis Huttenhower. If you are in the world of microbiome research you should be familiar with this name. Dr. Huttenhower covered a broad overview of his group’s research but dove in-depth with a few of the Biobakery’s newer tools including anpan, HAllA, and MaAsLin3. This was probably one of the more intense lab sessions we had and we certainly put those Rstudio VM’s to good use.

On the third to last day we had our Stats Day, led by the wonderful and talented Dr. Amy Willis. Dr. Willis walked the class through arguably some of the most complex and misunderstood areas of microbiome sequence analysis including relative abundances, alpha and beta diversity, as well as differential abundance analyses. Her research is certainly at the top of the field and if you are performing any microbiome analysis you should check out the StatDivLab’s github for their plethora of programs that will help give you confidence in your results. I also want to shoutout her wonderful PhD students Sarah, Shirley, and Maria who are the next generation of microbiome stats wizards.

My last full day was focused on metagenomic binning, with a little added flavor of strain level analysis and phylogenetics thanks to Mike Nute’s background and PhD with Dr. Tandy Warnow. During this lecture and lab, we walked through the steps of using common binning tools (and what they are doing) as well as ran a multiple sequence alignment using parsnp2 with a gingr visualization!

One of the speakers who was unable to come because of the dang C*VID was Dr. Mike Lee. Dr. Lee was still involved in this workshop as he was kind of our tech guy behind the scenes, as he had setup all our own individual compute instances (JupyterLab and Rstudio) and installed all the dependencies for the workshop labs to run smoothly. Mike Lee is someone I had really looked forward to meeting, as his website Happy Belly Bioinformatics, was one of the main resources I used to begin my bioinformatics journey, also we worked in the same field at different NASA facilities for a while. Mike if you’re reading this hmu!

Overall experience

I think this was a great learning experience and it was hosted at a fantastic institution that has a remarkable past. Proabably my biggest gripe about this conference was the lack of air conditioning in the housing. Coming from Houston Texas, where the AC is on so much that there is a 40 degree temperature difference between inside and outside this was a bit of a challenge but it should not defer anyone from attending this workshop, it’s part of the experience.

Should you go?

Yes. If you are a biologist working on any microbiome analysis this would be a very beneficial experience for you and even if you are slightly seasoned in microbial sequence analysis like myself, I was still able to find it quite useful and gave me some more confidence to help get over the ever present imposter syndrome. If you have any questions on the STAMPS workshop or on more aspects of my time at STAMPS feel free to reach out via my contact info and check out the STAMPS 2024 github!