DEV Community

Julia Silge profile picture

Julia Silge

I’m an international keynote speaker and real-world practitioner focused on data analysis and machine learning. I love making beautiful charts, the statistical programming language R, and Jane Austen.

Location Salt Lake City, UT Joined Joined on  Personal website https://juliasilge.com/ github website twitter website

Work

Data scientist & software engineer at RStudio PBC

Topic modeling for Spice Girls lyrics 🇬🇧👯‍♀️🎤

Topic modeling for Spice Girls lyrics 🇬🇧👯‍♀️🎤

Reactions 6 Comments
7 min read
Predicting viewership for Doctor Who episodes

Predicting viewership for Doctor Who episodes

Comments
4 min read
Predict giant pumpkin weights 🎃 with tidymodels

Predict giant pumpkin weights 🎃 with tidymodels

Reactions 6 Comments
6 min read
Spatial resampling for the #30DayMapChallenge 🗺

Spatial resampling for the #30DayMapChallenge 🗺

Reactions 1 Comments
4 min read
Multiclass predictive modeling for economics research papers 📑

Multiclass predictive modeling for economics research papers 📑

Reactions 6 Comments
6 min read
Dimensionality reduction for Billboard Top 100 songs 🎶

Dimensionality reduction for Billboard Top 100 songs 🎶

Reactions 3 Comments
7 min read
Fit and predict with tidymodels for bird baths in Australia 🇦🇺

Fit and predict with tidymodels for bird baths in Australia 🇦🇺

Comments
6 min read
Modeling human/computer interactions on Star Trek 🖖

Modeling human/computer interactions on Star Trek 🖖

Comments
6 min read
Predict housing prices 🏠 in Austin TX with xgboost

Predict housing prices 🏠 in Austin TX with xgboost

Reactions 1 Comments
10 min read
Use racing methods to tune xgboost models and predict home runs ⚾️

Use racing methods to tune xgboost models and predict home runs ⚾️

Reactions 3 Comments 1
5 min read
Tune xgboost models with early stopping to predict shelter animal status 🐱🐶

Tune xgboost models with early stopping to predict shelter animal status 🐱🐶

Reactions 1 Comments
5 min read
Predict which Scooby Doo monsters 👻 are REAL with a tuned decision tree model

Predict which Scooby Doo monsters 👻 are REAL with a tuned decision tree model

Reactions 3 Comments
5 min read
Create a custom metric for your machine learning model to predict NYC Airbnb prices

Create a custom metric for your machine learning model to predict NYC Airbnb prices

Comments
5 min read
Class imbalance and classification metrics with aircraft wildlife strikes ✈️

Class imbalance and classification metrics with aircraft wildlife strikes ✈️

Comments
7 min read
Partial dependence plots for Mario Kart 🍄 world records

Partial dependence plots for Mario Kart 🍄 world records

Reactions 1 Comments
5 min read
Predict water availability 🚰 in Sierra Leone with random forests

Predict water availability 🚰 in Sierra Leone with random forests

Reactions 5 Comments
5 min read
Estimate change in CEO departures with bootstrap resampling

Estimate change in CEO departures with bootstrap resampling

Reactions 5 Comments
4 min read
Which Netflix titles are movies and which are TV shows? 📺

Which Netflix titles are movies and which are TV shows? 📺

Reactions 7 Comments
6 min read
Use subword features to find which post offices are in Hawaii 🌺

Use subword features to find which post offices are in Hawaii 🌺

Reactions 1 Comments
8 min read
Dimensionality reduction of United Nations voting patterns 🌍

Dimensionality reduction of United Nations voting patterns 🌍

Comments
4 min read
Bootstrap confidence intervals for Super Bowl commercials 🏈

Bootstrap confidence intervals for Super Bowl commercials 🏈

Reactions 1 Comments
4 min read
Understand inequality in student debt 🎓 with linear modeling

Understand inequality in student debt 🎓 with linear modeling

Reactions 1 Comments
3 min read
Learn tidytext with my new learnr course

Learn tidytext with my new learnr course

Reactions 4 Comments
3 min read
Explore art over time in the Tate collection 🖼

Explore art over time in the Tate collection 🖼

Comments
10 min read
Code generation for tuning random forests using IKEA furniture prices 🛋

Code generation for tuning random forests using IKEA furniture prices 🛋

Reactions 7 Comments
5 min read
Tune and interpret decision trees for wind turbine capacity 🌬

Tune and interpret decision trees for wind turbine capacity 🌬

Reactions 3 Comments
7 min read
Predicting class membership for the Datasaurus Dozen 🦖

Predicting class membership for the Datasaurus Dozen 🦖

Reactions 4 Comments
7 min read
Modeling NCAA women's 🏀 tournament seeds

Modeling NCAA women's 🏀 tournament seeds

Reactions 7 Comments
8 min read
Introducing our new book, Tidy Modeling with R 📖

Introducing our new book, Tidy Modeling with R 📖

Reactions 7 Comments 1
1 min read
Handle class imbalance in modeling Himalayan climbing expeditions ⛰

Handle class imbalance in modeling Himalayan climbing expeditions ⛰

Comments
11 min read
Modeling crop yields 🌽🍚🌾 with tidy data principles

Modeling crop yields 🌽🍚🌾 with tidy data principles

Reactions 5 Comments
4 min read
Build a predictive text model for The Last Airbender

Build a predictive text model for The Last Airbender

Reactions 28 Comments
7 min read
Get started with tidymodels and the Palmer penguins 🐧

Get started with tidymodels and the Palmer penguins 🐧

Reactions 9 Comments
6 min read
Announcing our new book 📖! Supervised Machine Learning for Text Analysis in R

Announcing our new book 📖! Supervised Machine Learning for Text Analysis in R

Reactions 6 Comments
2 min read
Predicting astronaut mission duration 👩‍🚀🚀 with bootstrap aggregation

Predicting astronaut mission duration 👩‍🚀🚀 with bootstrap aggregation

Reactions 6 Comments
7 min read
The Bechdel test and the X-Mansion with bootstrap resampling 🦸‍♀️🦸‍♂️

The Bechdel test and the X-Mansion with bootstrap resampling 🦸‍♀️🦸‍♂️

Reactions 17 Comments
6 min read
Impute missing data for historical trans-Atlantic slave voyages

Impute missing data for historical trans-Atlantic slave voyages

Reactions 8 Comments 1
8 min read
PCA and UMAP with cocktail recipes 🥃🍸🍹

PCA and UMAP with cocktail recipes 🥃🍸🍹

Reactions 5 Comments
6 min read
Learn about log odds and empirical Bayes with cocktail 🍸 recipes

Learn about log odds and empirical Bayes with cocktail 🍸 recipes

Reactions 14 Comments 1
6 min read
Tune XGBoost with beach volleyball data 🏐

Tune XGBoost with beach volleyball data 🏐

Reactions 9 Comments
9 min read
Learn supervised machine learning with my free interactive course

Learn supervised machine learning with my free interactive course

Reactions 2 Comments
2 min read
Multinomial classification for volcano eruptions 🌋 with tidymodels

Multinomial classification for volcano eruptions 🌋 with tidymodels

Comments
7 min read
Building a sentiment analysis model with Animal Crossing user reviews

Building a sentiment analysis model with Animal Crossing user reviews

Reactions 10 Comments 1
8 min read
Predicting fines for GDPR violations with tidymodels

Predicting fines for GDPR violations with tidymodels

Reactions 6 Comments
8 min read
Principal component analysis and the best hip hop songs ever

Principal component analysis and the best hip hop songs ever

Reactions 8 Comments
10 min read
Bootstrap resampling with #TidyTuesday beer production data

Bootstrap resampling with #TidyTuesday beer production data

Reactions 5 Comments
5 min read
Tuning random forest hyperparameters in R with #TidyTuesday trees data

Tuning random forest hyperparameters in R with #TidyTuesday trees data

Reactions 1 Comments
7 min read
Lasso regression for IMDB ratings of The Office

Lasso regression for IMDB ratings of The Office

Reactions 7 Comments
9 min read
Practice handling dates in R with lubridate... THEATRICALLY

Practice handling dates in R with lubridate... THEATRICALLY

Reactions 6 Comments
8 min read
loading...