Haochen Pan
Thank you for stopping by! I am a fifth-year CS Ph.D. student at the University of Chicago and a proud member of Globus Labs (opens in new tab), advised by Dr. Kyle Chard (opens in new tab), Dr. Ian Foster (opens in new tab), and Dr. Ryan Chard (opens in new tab).

Current Research
My research focuses on distributed systems at the intersection of cloud and high-performance computing (HPC), with an emphasis on resilience and efficiency for AI-guided scientific workflows and time-sensitive data analysis.
- We developed Octopus (opens in new tab), a Kafka-based hierarchical event fabric for high-performance exchange of control and metadata events across cloud and HPC environments.
- Building on this, we designed Icicle (to be released), a real-time metadata monitoring and indexing system for Lustre and IBM Storage Scale that integrates Octopus, Apache Flink, and Globus Search to provide live visibility and historical usage analysis.
- More recently, we developed Science-MCP (opens in new tab), which applies the Model Context Protocol (MCP) to expose these capabilities as discoverable and composable services for LLM-powered agents across heterogeneous cyberinfrastructure.
Selected Publications
The complete list is available on Google Scholar (opens in new tab) and my CV.
- [Preprint (opens in new tab)]Experiences with Model Context Protocol Servers for Science and High Performance ComputingHaochen Pan, Ryan Chard, Reid Mello, Christopher Grams, Tanjin He, Alexander Brace, Owen Price Skelly, Will Engler, Hayden Holbrook, Song Young Oh, Maxime Gonthier, Michael Papka, Ben Blaiszik, Kyle Chard, Ian Foster
- [FTXS'24 (opens in new tab)]Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific ComputingHaochen Pan, Ryan Chard, Sicheng Zhou, Alok Kamatar, Rafael Vescovi, Valérie Hayot-Sasson, André Bauer, Maxime Gonthier, Kyle Chard, Ian Foster
- [FGCS Vol. 153 (opens in new tab)]The Globus Compute Dataset: An Open Function-as-a-Service Dataset From the Edge to the CloudAndré Bauer, Haochen Pan, Ryan Chard, Yadu Babuji, Josh Bryan, Devesh Tiwari, Ian Foster, Kyle Chard
- [OSDI'22 (opens in new tab)]Cancellation in Systems: An Empirical Study of Task Cancellation Patterns and FailuresUtsav Sethi, Haochen Pan, Shan Lu, Madanlal Musuvathi, Suman Nath
- [SOSP'21 (opens in new tab)]Rabia: Simplifying State-Machine Replication Through RandomizationHaochen Pan, Jesse Tuglu, Neo Zhou, Tianshu Wang, Yicheng Shen, Xiong Zheng, Joseph Tassarotti, Lewis Tseng, Roberto Palmieri
Projects
- 2025 Science MCPs (opens in new tab): Model Context Protocol (MCP) servers that enable AI assistants to interact with scientific computing resources and data management services.
- 2024 Diaspora (opens in new tab): a resilience-enabling event fabric for real-time scientific workflows across HPC systems
- 2021 Rabia (opens in new tab): a high-performance framework for state-machine replication within data centers
- 2019 Cars on the Grid (opens in new tab): a simulator for multi-hop broadcast in vehicular ad-hoc networks (VANETs)
- 2019 NFSB (opens in new tab): an extensible benchmarking framework for storage systems of Network Functions
- 2019 GitHub Trending Timeline (opens in new tab): a full-stack Python application to track GitHub Trending repositories over time
- 2018 CCM (opens in new tab): tools and scripts for automated deployment and management of Cassandra clusters
- 2017 Eagle Vision (opens in new tab): an iOS app to notify Boston College students of open seats in courses
High-Performance Computing
- Aug 2025[i4][Preprint (opens in new tab)]Experiences with Model Context Protocol Servers for Science and High Performance Computing
- Aug 2025[j4][FHPCP (opens in new tab)]Toward a Persistent Event-Streaming System for High-Performance Computing Applications
- Jul 2025[i3][Preprint (opens in new tab)]Throughput Estimation of Data Transport Networks from Digital Twin Measurements
- Jul 2025[j3][ApJS (opens in new tab)]RADAR—Radio Afterglow Detection and AI‑Driven Response: A Federated Framework for Gravitational Wave Event Follow‑Up
- Jun 2025[c22][ICS'25 (opens in new tab)]D-Rex: Heterogeneity-Aware Reliability Framework and Adaptive Algorithms for Distributed Storage
- May 2025[c21][CCGrid'25 (opens in new tab)]DynoStore: A wide-area distribution system for the management of data over heterogeneous storage
- May 2025[c20][CCGrid'25 (opens in new tab)]WRATH: Workload Resilience Across Task Hierarchies in Task-based Parallel Programming Frameworks
- May 2025[c19][IPDPS'25 (opens in new tab)]Optimizing Fine-Grained Parallelism Through Dynamic Load Balancing on Multi-Socket Many-Core Systems
- Jan 2025[i2][Preprint (opens in new tab)]MOFA: Discovering Materials for Carbon Capture with a GenAI-and Simulation-Based Workflow
- Nov 2024[c18][FTXS'24 (opens in new tab)]Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific Computing
- Sep 2024[c17][NRDPISI-1 (opens in new tab)]Diaspora: Resilience‑Enabling Services for Real‑Time Distributed Workflows
- Sep 2024[c15][eScience'24 (opens in new tab)]TaPS: A Performance Evaluation Suite for Task-based Execution Frameworks
Cloud Computing
- Sep 2024[c16][eScience'24 (opens in new tab)]An Empirical Investigation of Container Building Strategies and Warm Times to Reduce Cold Starts in Scientific Computing Serverless Functions
- Apr 2024[j2][FGCS Vol. 153 (opens in new tab)]The Globus Compute Dataset: An Open Function-as-a-Service Dataset From the Edge to the Cloud
- Dec 2020[c10][GLOBECOM'20 (opens in new tab)]BBB: A Lightweight Approach to Evaluate Private Blockchains in Clouds
- Mar 2020[c7][DUCSAN'20 (opens in new tab)]Tutorial: Google Cloud for Beginners: Architecture, Storage, and Computation
- Dec 2019[c3][PRDC'19 (opens in new tab)]BBB: Make Benchmarking Blockchains Configurable and Extensible
Distributed Systems
- Jul 2022[c14][OSDI'22 (opens in new tab)]Cancellation in Systems: An Empirical Study of Task Cancellation Patterns and Failures
- May 2022[c13][ICC'22 (opens in new tab)]Reliable Broadcast in Critical Applications: Asset Transfer and Smart Home
- Oct 2021[c12][SOSP'21 (opens in new tab)]Rabia: Simplifying State-Machine Replication Through Randomization
- Jan 2021[c11][ICDCN'21 (opens in new tab)]Practical Experience Report: Cassandra+: Trading-Off Consistency, Latency, and Fault-tolerance in Cassandra
- Dec 2020[j1][Computer Networks Vol.182 (opens in new tab)]Reliable broadcast with trusted nodes: Energy reduction, resilience, and speed
- Nov 2020[c9][NCA'20 (opens in new tab)]CassandrEAS: Highly Available and Storage-Efficient Distributed Key-Value Store with Erasure Coding
- Jul 2020[i1][Preprint (opens in new tab)]Reliable Broadcast in Practical Networks: Algorithm and Evaluation
- Mar 2020[c8][PerVehicle'20 (opens in new tab)]Make Multi-hop Broadcast in VANET Fast by Selecting a Better Route for Source Vehicle
- Mar 2020[c6][DUCSAN'20 (opens in new tab)]Tutorial: Deep Dive into Apache Cassandra: Theory, Design, and Application
- Mar 2020[c5][DUCSAN'20 (opens in new tab)]LiteDoc: Make Collaborative Editing Fast, Scalable, and Robust
- Dec 2019[c2][NCA'19 (opens in new tab)]Distributed Causal Memory in the Presence of Byzantine Servers
- Sep 2019[c1][Sarnoff'19 (opens in new tab)]A First Step Towards Production-Ready Network Function Storage: Benchmarking with NFSB