Prashant Pandey

Prashant Pandey

Research Scientist

VMware Research


I will join the School of computing at the University of Utah as an Assistant Professor in Fall 2022. I’m looking for self-motivated students. Drop me a note if you want to do cool research, run/hike in the mountains, and enjoy the world famous skii in Salt Lake City.

I am a Research Scientist at VMware Research. My goal as a researcher is to advance the theory and practice of resource-efficient data structures and employ them to democratize complex and large-scale data analyses. I have worked on designing and building tools for large-scale data management problems across computational biology, stream processing, and storage.

Previously, I did a postdoc at UC Berkeley working with Prof. Aydin Buluc and Prof. Katherine Yelick. Prior to that, I spent one year as a Postdoc at Carnegie Mellon University working with Prof. Carl Kingsford. I obtained my Ph.D. in Computer Science at Stony Brook University, and defended my dissertation, Fast and Space-Efficient Maps: Shrinking Big Data Down to Size. At Stony brook University, I was co-advised by Prof. Michael Bender and Prof. Rob Johnson. (Dissertation committee: Mike Ferdman, Rob Patro, Guy Blelloch.)

I love outdoors and occasionally like to scribble my experience in a blog.

  • Data Structures for Big Data
  • Graphs Processing
  • Computational Biology
  • Scalable Graph Neural Networks
  • PhD in Computer Science, 2018

    Stony Brook University

Recent & Upcoming Talks

Data Systems at Scale: Scaling Up by Scaling Down and Out
Time to Change Your Filter
Data Systems at Scale: Scaling Up by Scaling Down and Out

Recent Publications

(2022). Using Advanced Data Structures to Enable Responsive Security Monitoring. Cluster Computing 2022.

PDF Cite Project

(2022). An Incrementally Updatable and Scalable System for Large-Scale Sequence Search using the Bentley-Saxe Transformation. BIOINFORMATICS 2022.

PDF Cite Code Project

(2021). VariantStore: an index for large-scale genomic variant search. Genome Biology 2021.

PDF Cite Code Project Video

(2021). Terrace: A Hierarchical Graph Container for Skewed Dynamic Graphs. SIGMOD 2021.

PDF Cite Code Project Video

(2021). External-Memory Dictionaries in the Affine and PDAM Models. TOPC 2021.

PDF Cite Project

Academic Service


  • PC: IEEE BigData 2022, ACM BCB 2022, APOCS 2022, IPDPS 2022
  • External Reviewer: FAST 2022
  • Oxford BIOINFORMATICS (2022)
  • Journal of Computational Biology (2022)
  • Transactions on Knowledge and Data Engineering (TKDE) (2022)


  • PC: ACDA 2021, RECOMB-Seq 2021, IPDPS 2021, ALENEX 2021
  • Subreviewer: ISMB 2021, STACS 2021, HPEC 2021
  • Session Chair: ALENEX 2021
  • Journal of Computational Biology (2021)
  • Transactions on Knowledge and Data Engineering (TKDE) (2021)
  • IEEE Access (2021)


  • PC: EUROPAR 2020, RECOMB-Seq 2020
  • Subreviewer: RECOMB 2020
  • Oxford BIOINFORMATICS (2020)
  • Transactions on Parallel and Distributed Systems (TPDS) (2020)


  • PC: ESA 2019
  • Subreviewer: WABI 2019, CIAC 2019
  • IEEE Access (2019)
  • Oxford BIOINFORMATICS (2019)
  • Journal of Experimental Algorithms (JEA) (2019)


  • Oxford BIOINFORMATICS (2018)
  • Transactions on Databases (TODS) (2018)