I know you’re all waiting on the edge of your seats for an update on the cupcakes vs. muffins data science project, but unfortunately I don’t have any answers to that age-old question* yet.
As silly as it may sound, I’m actually considering using this data set for a paper about using PLS (partial least squares regression) for ecological data. So for now, I’m holding off on blogging about any results of analyses in case I end up wanting to use them for the publication.
One thing I’ve learned from my PhD at Tufts is that I really enjoy working data wrangling, visualization, and statistics in R. I enjoy it so much, that lately I’ve been strongly considering a career in data science after graduation. As a way to showcase my data science skills, I’ve been working on a side project to use webscraping and multivariate statistics to answer the age old question: Are cupcakes really that different from muffins?