🚀 Exciting News: I'm co-teaching a course on 📊 AI Evals for engineers and product managers. We completed our first cohort with lots of great feedback (see testimonials) and are teaching our second and final cohort in July. Sign up now!
About Me

My dog Papaya 🐕 and me on a hike 🥾
I'm Shreya Shankar, a fifth-year PhD student at UC Berkeley in the EECS department. I am in the data systems and foundations group, advised by Dr. Aditya Parameswaran and supported by the NDSEG Fellowship. Go Bears! 🐻
As of Spring 2025, I am also a visiting student researcher in the Systems Research @ Google group. We are exploring cost optimization for LLM-powered data processing systems.
Prior to my PhD, I worked as an ML engineer in industry. I completed my BS and MS in computer science at Stanford. Go Trees! 🌲

My dog Papaya 🐕 and me on a hike 🥾
🔬 Research Interests
I am interested in building effective systems for unstructured data analysis. I lead the DocETL project, which offers a DSL, query optimizer, and execution engine for unstructured data processing (VLDB 2025). Our work introduces agentic query optimization, which uses LLMs to logically rewrite query plans to be more accurate, efficient, or both. We also offer an IDE for prototyping DocETL pipelines, called DocWrangler (UIST 2025).
I also have a deep interest in MLOps. Most recently, I co-teach a practitioner-focused course on LLM evaluation that has enrolled over 1,000 participants. I am fortunate that several of my MLOps research projects have been deployed in production at major tech companies (e.g., Meta) and startups (e.g., LangChain).
👉 Click to show/hide full bio for speaking engagements
📝 Bio (for speaking engagements, etc.)
Shreya Shankar is a PhD student in computer science at UC Berkeley. She is broadly interested in unstructured and AI-powered data analysis and leads the DocETL research project. She also co-teaches a hands-on LLM evaluation course for practitioners, with over 1000 participants to date.
Shreya is advised by Dr. Aditya Parameswaran. Her work appears in top data management and HCI venues like SIGMOD, VLDB, CIDR, CSCW and UIST, and she co-organizes the DEEM workshop at SIGMOD. She is supported by the NDSEG Fellowship. Prior to Berkeley, she worked as an ML engineer after completing her B.S. in computer science at Stanford University. In her free time, she enjoys roasting coffee and is actively trying to reduce her Twitter usage.
📰 News and Industry Impact
Recent News
- [July 2025] We will be presenting DocWrangler and DataScout at UIST! See you in Busan, Korea! 🇰🇷
- [May 2025] We will be presenting DocETL at VLDB! See you in London! 🇬🇧
- [March 2025] I'm teaching a course on LLM Evals in May: AI Evals For Engineers. It will be a hands-on, interactive course with real homework assignments and collaborative learning sessions!
- [Jan 2025] We released DocWrangler, an IDE for writing DocETL pipelines! Read more about it in our blog post and access DocWrangler here.
Companies That Like Our Work 👍
👨🏫 Mentorship
I am fortunate to work with many talented students at UC Berkeley. Below is a list of students I am currently mentoring or have mentored for a year or more.
Current Students
- Lindsey Wei (University of Washington undergraduate)
- Ruiqi Chen (University of Washington master's student)
- Nikhil Rao and Vinay Rao (high school students)
Past Students
- Rachel Lin (UC Berkeley master's student) - First-author paper at UIST 2025.
- Quentin Romero Lauro (University of Pittsburgh undergraduate, REU at UC Berkeley) - First-author paper under submission.
- Reya Vir (UC Berkeley undergraduate) - First-authored a publication at NAACL Now pursuing a PhD at Columbia University, with support from the NSF GRFP.
- Ankush Garg (UC Berkeley master's student)
- Parth Asawa (former UC Berkeley undergraduate) - Co-authored two publications at CIDR and VLDB. Now pursuing a PhD at UC Berkeley.
- Yujie Wang (former UC Berkeley undergraduate) - Co-authored a publication at CIDR. Joined Google after graduation.
- Aditi Mahajan (former UC Berkeley undergraduate) - Joined Google after graduation.
🗣️ Selected Invited Talks
Structured Analysis of Unstructured Data
- [July '25] Adobe Research
- [June '25] Redis Labs
DocWrangler
- [May '25] LangChain Disrupt Conference
- [April '25] SF Public Defender's Office
- [April '25] Spring EPIC Lab Retreat
- [March '25] Montreal HCI Seminar
DocETL
- [March '25] UC Berkeley BLISS Lab Seminar
- [March '25] Brown University DB Seminar
- [Feb '25] Columbia University DB Seminar
- [Feb '25] Scottish Climate Intelligence Service
- [Jan '25] Cloudera
- [Dec '24] Microsoft: Gray Systems Lab
- [Nov '24] Snowflake
- [Nov '24] ByteDance (TikTok)
- [Nov '24] Google: Systems Research Group
- [Nov '24] WInE Lab at CMU
- [Nov '24] Solventum
- [Oct '24] US Army Research Laboratory
Some Past Recordings
📬 Contact
Email: shreyashankar@berkeley.edu
Twitter |
Github