Hello, my name is

Harshal Chaudhari

I'm a Senior Applied Scientist at Etsy working on delivering personalized recommendations. Prior to this job, I was a PhD candidate in Computer Science at Boston University, co-advised by Prof. Evimaria Terzi and Prof. John Byers. My research interests include problems in algorithmic data mining, operations research and their applications in urban transportation systems.

Research

  • Explainable A.I. Systems
  • Reinforcement Learning
  • Robust Optimization
  • Causal Inference

Programming

  • Python, Java, C++
  • Stata, R
  • Apache Spark, Hadoop
  • SQL, MongoDB

Courses

  • Statistical ML
  • Advanced Algorithms
  • Optimization Theory
  • Operations Research

Latest News

Oct 2023: Our work, "Towards Flexibility and Robustness of LSM Trees" has been accepted for publication by the VLDB Journal!

May 2022: Our work, "Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty" has been accepted for publication at VLDB 2022!

Aug 2021: Started new job as a Senior Applied Scientist at Etsy, Inc. on the recommendations ranking team.

Oct 2020: Our work "Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet" has been accepted at IEEE BigData, 2020.

Aug 2020: My work with Zillow Group on Fairness in Multistakeholder Recommendations has been accepted at FATREC workshop held alongside RecSys 2020.

Aug 2019: I will be visiting Anchorage, Alaska for KDD 2019. Hope to see you there!

Oct 2018: After spending a very worthwhile summer interning at Zillow, we have decided to continue our research together remotely.

May 2018: Excited to spend my summer at Zillow Group as an AI intern on the Personalization team.

Feb 2018: Our paper "Putting Data in the Driver's Seat: Optimizing Earnings for On-Demand Ride-Hailing" featured on The Morning Paper.

Jan 2018: Our paper "Markov Chain Monitoring" accepted at SIAM International Conference on Data Mining (SDM18), San Diego.

Dec 2018: Our paper "Impact of free app promotion on future sales" accepted at TSMO 2018: Workshop on Two-sided Marketplace Optimization

Oct 2017: Our paper "Putting Data in the Driver's Seat: Optimizing Earnings for On-Demand Ride-Hailing" accepted at WSDM 2018, Los Angeles.

Research projects

Towards Flexibility and Robustness of LSM Trees
  • Authors:
  • Andy Huynh,
  • Harshal A. Chaudhari,
  • Evimaria Terzi,
  • Manos Athanassoulis

This is an extended journal version of our previous work on developing Endure, a new paradigm for tuning LSM trees in the presence of workload uncertainty. In this work, we explore whether an expanded and more flexible design space called K-LSM can offer similar benefits as robust formulation of the throughput maximization problem that we used while developing Endure. We find that robust optimization is the only approach that consistently high performance of database systems in presence of uncertainty.

A General Framework for Fairness in Multistakeholder Recommendations
  • Authors:
  • Harshal A. Chaudhari,
  • Sangdi Lin,
  • Ondrej Linda

Traditionally, multistakeholder recommendations problems have been formulated as integer linear programs which compute recommendations in an offline fashion, by incorporating provider constraints. Such approaches can lead to unforeseen biases wherein certain users consistently receive low utility recommendations in order to meet the global coverage constraints. To remedy this situation, we propose a general formulation that incorporates provider coverage objectives alongside individual user objectives, in a real-time personalized recommender system.

Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty
  • Authors:
  • Andy Huynh,
  • Harshal A. Chaudhari,
  • Evimaria Terzi,
  • Manos Athanassoulis

Modern LSM-tree backed key-value stores co-tune merge policies, buffer sizes and the false positive rates for the Bloom filters across different levels of LSM-tree. These systems typically minimize the costs for fixed workloads. We augment them to make them robust to perturbations in workloads.

Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet
  • Authors:
  • Harshal A. Chaudhari,
  • John W. Byers,
  • Evimaria Terzi

We combine the interpretability of vanilla reinforcement learning with combinatorial optimization techniques to propose a systematically tunable, scalable and effective framework to maximize earnings of a fleet of ride-share drivers.

Markov Chain Monitoring
  • Authors:
  • Harshal A. Chaudhari,
  • Michael Mathioudakis,
  • Evimaria Terzi

Given an initial distribution of items over the nodes of a Markov chain, we wish to estimate the distribution of items at subsequent times. In deriving these estimates, we issue queries to retrieve partial information on the distribution of items.

Putting Data in the Driver's Seat
  • Authors:
  • Harshal A. Chaudhari,
  • John W. Byers,
  • Evimaria Terzi

We formalize the problem of devising a strategy to maximize expected earnings of ride-hailing service driver, describe a series of algorithms to solve the problem, and exemplify the methods on a large scale simulation of driving for Uber in NYC ... Read more.

Impacts of free app promotion
  • Authors:
  • Harshal A. Chaudhari,
  • John W. Byers

Amazon's Free App of the Day program, aimed at improving app visibility using daily free promotions, is a compelling experiment in the 'economics of free'. We investigate its longer-term consequences on the performance of apps on Amazon Appstore.

Resume

Your web browser doesn't have PDF plugin. Instead you can click here to download the PDF file.

My fullform CV is available here.

You can also find me on the following channels