Louis Oberdiear: Introducing xp_wOBA (Expected Pitch wOBA)

Louis Oberdiear

batter_2019 <- readr::read_delim(file = "C:\\Users\\louis\\Downloads\\batter_2019.csv", delim = ",")

desired_events <- c("field_out",
                    "strikeout",
                    "single",
                    "walk",
                    "double",
                    "home_run",
                    "force_out",
                    "grounded_into_double_play",
                    "hit_by_pitch",
                    "field_error",
                    "triple",
                    "fielders_choice",
                    "double_play",
                    "fielders_choice_out",
                    "strikeout_double_play")

batter_2019_events <- batter_2019 %>%
  filter(events %in% desired_events) %>%
  mutate(runner_1b = if_else(!is.na(on_1b), 1, 0),
         runner_2b = if_else(!is.na(on_2b), 1, 0),
         runner_3b = if_else(!is.na(on_3b), 1, 0)) %>%
  drop_na(release_speed) %>%
  drop_na(zone) %>%
  drop_na(pitch_type)

xgboost_fit_woba <- readRDS(file = "C:\\Users\\louis\\Documents\\GitHub\\xp_woba\\xgboost_fit_woba.rds")

set.seed(123)
be_split <- initial_split(batter_2019_events, prop = 3/4)
be_train <- training(be_split)
be_test <- testing(be_split)

woba_formula <- formula(woba_value ~ release_speed + pitch_type + zone + stand + p_throws + balls + strikes + outs_when_up + pfx_x + pfx_z + runner_1b + runner_2b + runner_3b + plate_x + plate_z)

preprocessing_recipe_woba <- 
  recipes::recipe(woba_formula, data = be_train) %>%
  recipes::step_integer(all_nominal()) %>%
  prep()

Event	wOBA value
walk	0.70
hit by pitch	0.70
field error	0.90
single	0.90
double	1.25
triple	1.60
home run	2.00
all other	0.00

.metric	.estimator	.estimate
rmse	standard	0.4985808
rsq	standard	0.1156307
mae	standard	0.3909066

player_name	game_date	.pred	release_speed	pitch_type	zone	balls	strikes
Bichette, Bo	2021-07-02	0.8719966	47.5	EP	11	3	1
Soler, Jorge	2021-06-04	0.8524219	44.6	EP	12	3	1
Bichette, Bo	2021-07-02	0.8424013	46.8	EP	11	3	0
Riley, Austin	2021-06-30	0.8070010	64.1	FA	11	1	0
Guerrero Jr., Vladimir	2021-07-02	0.8058015	49.1	EP	14	3	1
Correa, Carlos	2021-04-24	0.8051472	60.8	FA	8	2	1

player_name	game_date	.pred	release_speed	pitch_type	zone	strikes
Naquin, Tyler	2021-04-17	-0.05260795	96.4	FF	11	2
Galvis, Freddy	2021-05-09	-0.05136005	97.0	FF	11	2
Myers, Wil	2021-05-01	-0.04837190	93.4	FF	11	2
Laureano, Ramón	2021-06-22	-0.04584930	94.3	FF	11	2
Bregman, Alex	2021-06-02	-0.03250195	95.3	FF	12	2
Zunino, Mike	2021-05-12	-0.03191937	96.9	FF	12	2

player_name	game_date	.pred	release_speed	pitch_type	zone	balls	strikes
Maldonado, Martín	2021-05-20	0.1083588	93.6	FF	2	0	1
Lowe, Brandon	2021-06-05	0.1120782	92.3	FF	8	0	1
Lowe, Brandon	2021-04-11	0.1146214	98.2	FF	2	0	1
Laureano, Ramón	2021-06-22	0.1163044	93.5	FF	2	0	1
Naquin, Tyler	2021-04-17	0.1180818	95.4	FF	2	0	0
Polanco, Gregory	2021-04-18	0.1310713	94.5	FF	3	1	0

Introducing xp_wOBA (Expected Pitch wOBA)

Author

Affiliation

Published

Citation

Motivation

Why wOBA?

Methodology

Data

Modeling

Top Features

Best Pitches of the 2021 season

Highest Predicted Value

Lowest Predicted Value

Most unlikely home runs

Biggest meatball missed

xp_wOBAOE (Expected Pitch wOBA Over Expected)

Future Improvements

Future Analysis

Footnotes

Corrections

Citation

player_name	game_date	balls	strikes	.pred	release_speed	pitch_type	zone
Naquin, Tyler	2021-04-17	0	0	0.118081808	95.4	FF	2
Maldonado, Martín	2021-05-20	0	1	0.108358778	93.6	FF	2
Naquin, Tyler	2021-04-17	0	2	-0.052607950	96.4	FF	11
Polanco, Gregory	2021-04-18	1	0	0.131071314	94.5	FF	3
Turner, Trea	2021-06-08	1	1	0.155475900	93.3	FF	3
Arozarena, Randy	2021-04-19	1	2	-0.024589863	95.6	FF	11
Profar, Jurickson	2021-04-16	2	0	0.180670932	94.4	FF	3
Rojas, Miguel	2021-04-27	2	1	0.207329497	92.6	FF	3
Santana, Carlos	2021-04-11	2	2	-0.009279301	81.5	KC	14
Díaz, Yandy	2021-05-26	3	0	0.399502724	95.0	FF	1
Ohtani, Shohei	2021-05-10	3	1	0.363996923	76.3	SL	4
Walls, Taylor	2021-07-10	3	2	0.151510030	94.0	FF	7
Tucker, Kyle	2021-06-26	4	2	0.404175460	92.5	SI	13

player_name	game_date	woba_value	.pred	release_speed	pitch_type	zone	balls	strikes
Hoskins, Rhys	2021-06-22	2	0.05392908	97.9	FF	11	1	2
Altuve, Jose	2021-06-10	2	0.05897641	85.3	SL	13	0	2
Chisholm Jr., Jazz	2021-04-10	2	0.07070240	100.4	FF	12	0	2
Carlson, Dylan	2021-06-02	2	0.07594188	95.4	FF	12	2	2
Astudillo, Willians	2021-04-23	2	0.08661124	92.8	FF	12	1	2
Martinez, J.D.	2021-07-21	2	0.09053584	98.6	FF	12	2	2

player_name	game_date	.pred	release_speed	pitch_type	zone	balls	strikes
Adrianza, Ehire	2021-06-30	0.7647199	64.6	FA	2	2	1
McCormick, Chas	2021-04-24	0.7291481	60.6	FA	6	2	0
Toro, Abraham	2021-04-24	0.7173564	59.6	FA	4	0	0
Suzuki, Kurt	2021-04-16	0.7135440	59.0	FA	5	1	0
Bregman, Alex	2021-06-13	0.7102520	90.7	FF	4	2	0
Bradley, Bobby	2021-07-25	0.7083586	80.2	SL	4	2	0

player_name	n	xp_wOBAOE_sum
Ohtani, Shohei	381	41.02039
Tatis Jr., Fernando	349	34.33326
Guerrero Jr., Vladimir	406	33.23778
Castellanos, Nick	364	30.05024
Bogaerts, Xander	393	29.75805
Mullins, Cedric	418	28.39065
Devers, Rafael	406	27.89499
Martinez, J.D.	405	22.14015
Perez, Salvador	404	21.58211
Reynolds, Bryan	407	19.90814

player_name	n	xp_wOBAOE_avg
Buxton, Byron	110	0.17926097
Trout, Mike	141	0.11095308
Ohtani, Shohei	381	0.10766506
Tatis Jr., Fernando	349	0.09837609
Castellanos, Nick	364	0.08255561
Wisdom, Patrick	164	0.08236782
Guerrero Jr., Vladimir	406	0.08186644
Marte, Ketel	147	0.07708410
Bogaerts, Xander	393	0.07572022
Devers, Rafael	406	0.06870686