I woke up this morning to an email saying my first R package, holodeck, was on it’s way to CRAN! It’s a humble package, providing a framework for quickly slapping together test data with different degrees of correlation between variables and differentiation among levels of a categorical variable.
Example use of holodeck library(holodeck) library(dplyr) df <- #make a categorical variable with 10 observations and 3 groups sim_cat(n_obs = 10, n_groups = 3, name = "Treatment") %>% #add 3 variables that covary sim_covar(n_vars = 3, var = 1, cov = 0.
This was my first time attending RStudio::conf, and I went primarily to explore my career options in data science. I mainly stuck to teaching and modeling related talks since that’s how I already use R. Here are my major takeaways from the conference.
Shiny is the new hotness Shiny apps are interactive web apps that run on R code, and there was a big focus on Shiny development at the conference this year.
I’m currently in Hangzhou, China at the Tea Research Institute(TRI) for my fourth and last time. It’s bitter sweet (like my favorite teas ;-) ) since I’m both glad to be nearing the end of my PhD, and sad to say goodbye to all the friends I’ve made and a city I’ve really grown to enjoy living in.
Fieldwork This final summer, I’ve been focusing on a few experiments having to do with leafhoppers and their effects on tea chemistry (see the project page for more info).
I know you’re all waiting on the edge of your seats for an update on the cupcakes vs. muffins data science project, but unfortunately I don’t have any answers to that age-old question* yet.
As silly as it may sound, I’m actually considering using this data set for a paper about using PLS (partial least squares regression) for ecological data. So for now, I’m holding off on blogging about any results of analyses in case I end up wanting to use them for the publication.
One thing I’ve learned from my PhD at Tufts is that I really enjoy working data wrangling, visualization, and statistics in R. I enjoy it so much, that lately I’ve been strongly considering a career in data science after graduation. As a way to showcase my data science skills, I’ve been working on a side project to use webscraping and multivariate statistics to answer the age old question: Are cupcakes really that different from muffins?
I’ve always thought I should eventually have a professional website for job hunting, but recently I’ve realized that it would be nice to have somewhere to collect my thoughts and contributions all in one place. Before having a website of my own, I generally shared updates and thoughts on Twitter and on other people’s blogs, which I will still continue to do, but having everything I do in one place seems like a good idea.