[Pingchuan :)]
Pingchuan Ma
Ph.D Student, ASU
Email: pingchua (at) asu.edu
Biography
Pingchuan Ma (馬平川) is a third-year Ph.D Student at ASU, advised by Prof. Huan Liu in DMML. His current research interests include causal machine learning. Previously, he obtained his master of science degree from USC and worked as an analog engineer at Renesas. Before that, he received his undergraduate degree from ShanghaiTech University. He had the privilege to work with Prof. Jiaqi Gu before. [...]
Name

Family name: () means horse.

Given name: (Píng)(chuān) means flat and open land without geographical barriers.

Quoted from a Chinese idiom 一馬平川, which depicts a lone horse galloping freely across a vast, unobstructed plain. It symbolizes unimpeded progress and a life journey free of obstacles.

眾峰來自天目山,勢若駿平川

There are lots of mountains from Tianmu Mountain, which have the might of fine horses galloping across flat ground.

Selected Research
(* indicates equal contribution, highlight indicates representative papers, full publication list can be found in Scholar)

Abstract: Crisis classification in social media aims to extract actionable disaster-related information from multimodal posts, which is a crucial task for enhancing situational awareness and facilitating timely emergency responses. However, the wide variation in crisis types makes achieving generalizable performance across unseen disasters a persistent challenge. Existing approaches primarily leverage deep learning to fuse textual and visual cues for crisis classification, achieving numerically plausible results under in-domain settings. However, they exhibit poor generalization across unseen crisis types because they 1. do not disentangle spurious and causal features, resulting in performance degradation under domain shift, and 2. fail to align heterogeneous modality representations within a shared space, which hinders the direct adaptation of established single-modality domain generalization (DG) techniques to the multimodal setting. To address these issues, we introduce a causality-guided multimodal domain generalization (MMDG) framework that combines adversarial disentanglement with unified representation learning for crisis classification. The adversarial objective encourages the model to disentangle and focus on domain-invariant causal features, leading to more generalizable classifications grounded in stable causal mechanisms. The unified representation aligns features from different modalities within a shared latent space, enabling single-modality DG strategies to be seamlessly extended to multimodal learning. Experiments on the different datasets demonstrate that our approach achieves the best performance in unseen disaster scenarios.


  @misc{ma2025camocausalityguidedadversarialmultimodal,
      title={CAMO: Causality-Guided Adversarial Multimodal Domain Generalization for Crisis Classification}, 
      author={Pingchuan Ma and Chengshuai Zhao and Bohan Jiang and Saketh Vishnubhatla and Ujun Jeong and Alimohammad Beigi and Adrienne Raglin and Huan Liu},
      year={2025},
      eprint={2512.08071},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2512.08071}, 
}

Abstract: Chain-of-Thought (CoT) prompting has been shown to improve Large Language Model (LLM) performance on various tasks. With this approach, LLMs appear to produce human-like reasoning steps before providing answers (a.k.a., CoT reasoning), which often leads to the perception that they engage in deliberate inferential processes. However, some initial findings suggest that CoT reasoning may be more superficial than it appears, motivating us to explore further. In this paper, we study CoT reasoning via a data distribution lens and investigate if CoT reasoning reflects a structured inductive bias learned from in-distribution data, allowing the model to conditionally generate reasoning paths that approximate those seen during training. Thus, its effectiveness is fundamentally bounded by the degree of distribution discrepancy between the training data and the test queries. With this lens, we dissect CoT reasoning via three dimensions: task, length, and format. To investigate each dimension, we design DataAlchemy, an isolated and controlled environment to train LLMs from scratch and systematically probe them under various distribution conditions. Our results reveal that CoT reasoning is a brittle mirage that vanishes when it is pushed beyond training distributions. This work offers a deeper understanding of why and when CoT reasoning fails, emphasizing the ongoing challenge of achieving genuine and generalizable reasoning.

To be released upon acceptance.

Abstract: The finite-difference time-domain (FDTD) method, which is important in photonic hardware design flow, is widely adopted to solve time-domain Maxwell equations. However, FDTD is known for its prohibitive runtime cost, taking minutes to hours to simulate a single device. Recently, AI has been applied to realize orders-of-magnitude speedup in partial differential equation (PDE) solving. However, AI-based FDTD solvers for photonic devices have not been clearly formulated. Directly applying off-the-shelf models to predict the optical field dynamics shows unsatisfying fidelity and efficiency since the model primitives are agnostic to the unique physical properties of Maxwell equations and lack algorithmic customization. In this work, we thoroughly investigate the synergy between neural operator designs and the physical property of Maxwell equations and introduce a physics-inspired AI-based FDTD prediction framework PIC2O-Sim which features a causality-aware dynamic convolutional neural operator as its backbone model that honors the space-time causality constraints via careful receptive field configuration and explicitly captures the permittivity-dependent light propagation behavior via an efficient dynamic convolution operator. Meanwhile, we explore the trade-offs among prediction scalability, fidelity, and efficiency via a multi-stage partitioned time-bundling technique in autoregressive prediction. Multiple key techniques have been introduced to mitigate iterative error accumulation while maintaining efficiency advantages during autoregressive field prediction. Extensive evaluations on three challenging photonic device simulation tasks have shown the superiority of our PIC2O-Sim method, showing 51.2% lower roll-out prediction error, 23.5 times fewer parameters than state-of-the-art neural operators, providing 300-600x higher simulation speed than an open-source FDTD numerical solver.

@article{10.1063/5.0242728,
  author = {Ma, Pingchuan and Yang, Haoyu and Gao, Zhengqi and Boning, Duane S. and Gu, Jiaqi},
  title = {PIC2O-Sim: A physics-inspired causality-aware dynamic convolutional neural operator for ultra-fast photonic device time-domain simulation},
  journal = {APL Photonics},
  volume = {10},
  number = {3},
  pages = {036104},
  year = {2025},
  month = {03},
  abstract = {Optical simulation plays an important role in photonic hardware design flow. The finite-difference time-domain (FDTD) method is widely adopted to solve time-domain Maxwell equations. However, FDTD is known for its prohibitive runtime cost as it iteratively solves Maxwell equations and takes minutes to hours to simulate a single device. Recently, AI has been applied to realize orders-of-magnitude speedup in partial differential equation solving. However, AI-based FDTD solvers for photonic devices have not been clearly formulated. Directly applying off-the-shelf models to predict the optical field dynamics shows unsatisfying fidelity and efficiency since the model primitives are agnostic to the unique physical properties of Maxwell equations and lack algorithmic customization. In this work, we thoroughly investigate the synergy between neural operator designs and the physical property of Maxwell equations and introduce a physics-inspired AI-based FDTD prediction framework PIC2O-Sim. PIC2O-Sim features a causality-aware dynamic convolutional neural operator as its backbone model that honors the space–time causality constraints via careful receptive field configuration and explicitly captures the permittivity-dependent light propagation behavior via an efficient dynamic convolution operator. Meanwhile, we explore the trade-offs among prediction scalability, fidelity, and efficiency via a multi-stage partitioned time-bundling technique in autoregressive prediction. Multiple key techniques have been introduced to mitigate iterative error accumulation while maintaining efficiency advantages during autoregressive field prediction. Extensive evaluations on three challenging photonic device simulation tasks have shown the superiority of our PIC2O-Sim method, showing 51.2\% lower roll-out prediction error, 23.5 times fewer parameters than state-of-the-art neural operators, providing 133–310× or 31–89× higher simulation speed than an open-source single-process or eight-process parallel FDTD numerical solver.},
  issn = {2378-0967},
  doi = {10.1063/5.0242728},
  url = {https://doi.org/10.1063/5.0242728},
  eprint = {https://pubs.aip.org/aip/app/article-pdf/doi/10.1063/5.0242728/20420133/036104\_1\_5.0242728.pdf},
}

Abstract: Nanophotonic device design aims to optimize photonic structures to meet specific requirements across various applications. Inverse design has unlocked non-intuitive, high-dimensional design spaces, enabling the discovery of high-performance devices beyond heuristic or analytic methods. The adjoint method, which calculates gradients for all variables using just two simulations, enables efficient navigation of this complex space. However, many inverse-designed structures, while numerically plausible, are difficult to fabricate and sensitive to variations, limiting their practical use. The discrete nature with numerous local-optimal structures also pose significant optimization challenges, often causing gradient-based methods to converge on suboptimal designs. In this work, we formulate inverse design as a fabrication-restricted, discrete, probabilistic optimization problem and introduce BOSON-1, an end-to-end, variation-aware subspace optimization framework to address the challenges of manufacturability, robustness, and optimizability. To overcome optimization difficulty, we propose dense target-enhanced gradient flows to mitigate misleading local optima and introduce a conditional subspace optimization strategy to create high-dimensional tunnels to escape local optima. Furthermore, we significantly reduce the runtime associated with optimizing across exponential variation samples through an adaptive sampling-based robust optimization, ensuring both efficiency and variation robustness. On three representative photonic device benchmarks, our proposed inverse design methodology BOSON-1 delivers fabricable structures and achieves the best convergence and performance under realistic variations, outperforming prior arts with 74.3% post-fabrication performance.

@INPROCEEDINGS{10993227,
  author={Ma, Pingchuan and Gao, Zhengqi and Begovic, Amir and Zhang, Meng and Yang, Haoyu and Ren, Haoxing and Huang, Rena and Boning, Duane S. and Gu, Jiaqi},
  booktitle={2025 Design, Automation & Test in Europe Conference (DATE)}, 
  title={BOSON−1: Understanding and Enabling Physically-Robust Photonic Inverse Design with Adaptive Variation-Aware Subspace Optimization}, 
  year={2025},
  volume={},
  number={},
  pages={1-7},
  keywords={Performance evaluation;Optical losses;Runtime;Navigation;Optimization methods;Robustness;Topology;Numerical models;Optimization;Photonics},
  doi={10.23919/DATE64628.2025.10993227}}

Abstract: Inverse design has emerged as a transformative approach for photonic device optimization, enabling the exploration of high-dimensional, non-intuitive design spaces to create ultra-compact devices and advance photonic integrated circuits (PICs) in computing and interconnects. However, practical challenges, such as suboptimal device performance, limited manufacturability, high sensitivity to variations, computational inefficiency, and lack of interpretability, have hindered its adoption in commercial hardware. Recent advancements in AI-assisted photonic simulation and design offer transformative potential, accelerating simulations and design generation by orders of magnitude over traditional numerical methods. Despite these breakthroughs, the lack of an open-source, standardized infrastructure and evaluation benchmark limits accessibility and cross-disciplinary collaboration. To address this, we introduce MAPS, a multi-fidelity AI-augmented photonic simulation and inverse design infrastructure designed to bridge this gap. MAPS features three synergistic components: (1) MAPS-Data: A dataset acquisition framework for generating multi-fidelity, richly labeled devices, providing high-quality data for AI-for-optics research. (2) MAPS-Train: A flexible AI-for-photonics training framework offering a hierarchical data loading pipeline, customizable model construction, support for data- and physics-driven losses, and comprehensive evaluations. (3) MAPS-InvDes: An advanced adjoint inverse design toolkit that abstracts complex physics but exposes flexible optimization steps, integrates pre-trained AI models, and incorporates fabrication variation models. This infrastructure MAPS provides a unified, open-source platform for developing, benchmarking, and advancing AI-assisted photonic design workflows, accelerating innovation in photonic hardware optimization and scientific machine learning.

@INPROCEEDINGS{10993033,
  author={Mal, Pingchuan and Gao, Zhengqi and Zhang, Meng and Yang, Haoyu and Ren, Mark and Huang, Rena and Boning, Duane S. and Gu, Jiaqi},
  booktitle={2025 Design, Automation & Test in Europe Conference (DATE)}, 
  title={MAPS: Multi-Fidelity AI-Augmented Photonic Simulation and Inverse Design Infrastructure}, 
  year={2025},
  volume={},
  number={},
  pages={1-6},
  keywords={Training;Technological innovation;Sensitivity;Pipelines;Machine learning;Data models;Hardware;Optimization;Photonics;Load modeling},
  doi={10.23919/DATE64628.2025.10993033}}

Abstract: DONNs leverage light propagation for efficient analog AI and signal processing. Advances in nanophotonic fabrication and metasurface-based wavefront engineering have opened new pathways to realize high-capacity DONNs across various spectral regimes. Training such DONN systems to determine the metasurface structures remains challenging. Heuristic methods are fast but oversimplify metasurfaces modulation, often resulting in physically unrealizable designs and significant performance degradation. Simulation-in-the-loop optimizes implementable metasurfaces via adjoint methods, but is computationally prohibitive and unscalable. To address these limitations, we propose SP2RINT, a spatially decoupled, progressive training framework that formulates DONN training as a PDE-constrained learning problem. Metasurface responses are first relaxed into freely trainable transfer matrices with a banded structure. We then progressively enforce physical constraints by alternating between transfer matrix training and adjoint-based inverse design, avoiding per-iteration PDE solves while ensuring final physical realizability. To further reduce runtime, we introduce a physics-inspired, spatially decoupled inverse design strategy based on the natural locality of field interactions. This approach partitions the metasurface into independently solvable patches, enabling scalable and parallel inverse design with system-level calibration. Evaluated across diverse DONN training tasks, SP2RINT achieves digital-comparable accuracy while being 1825 times faster than simulation-in-the-loop approaches. By bridging the gap between abstract DONN models and implementable photonic hardware, SP2RINT enables scalable, high-performance training of physically realizable meta-optical neural systems.

@misc{ma2025sp2rintspatiallydecoupledphysicsinspiredprogressive,
  title={SP2RINT: Spatially-Decoupled Physics-Inspired Progressive Inverse Optimization for Scalable, PDE-Constrained Meta-Optical Neural Network Training}, 
  author={Pingchuan Ma and Ziang Yin and Qi Jing and Zhengqi Gao and Nicholas Gangi and Boyang Zhang and Tsung-Wei Huang and Zhaoran Huang and Duane S. Boning and Yu Yao and Jiaqi Gu},
  year={2025},
  eprint={2505.18377},
  archivePrefix={arXiv},
  primaryClass={physics.optics},
  url={https://arxiv.org/pdf/2505.18377}, 
}
Ziyang Jiang, Pingchuan Ma, Meng Zhang, Rena Huang, and Jiaqi Gu
Key Words: Hardware Architecture Search (HAS), Genetic Algorithm (GA), Photonic Tensor Cores

Abstract: Photonic tensor cores (PTCs) are essential building blocks for optical artificial intelligence (AI) accelerators based on programmable photonic integrated circuits. Most PTC designs today are manually constructed, with low design efficiency and unsatisfying solution quality. This makes it challenging to meet various hardware specifications and keep up with rapidly evolving AI applications. Prior work has explored gradient-based methods to learn a good PTC structure differentiably. However, it suffers from slow training speed and optimization difficulty when handling multiple non-differentiable objectives and constraints. Therefore, in this work, we propose a more flexible and efficient zero-shot multi-objective evolutionary topology search framework ADEPT-Z that explores Pareto-optimal PTC designs with advanced devices in a larger search space. Multiple objectives can be co-optimized while honoring complicated hardware constraints. With only less than 3 hours of search, we can obtain tens of diverse Pareto-optimal solutions, 100x faster than the prior gradient-based method, outperforming prior manual designs with 2x higher accuracy weighted area-energy efficiency.

@inbook{10.1145/3658617.3697708,
author = {Jiang, Ziyang and Ma, Pingchuan and Zhang, Meng and Huang, Rena and Gu, Jiaqi},
title = {ADEPT-Z: Zero-Shot Automated Circuit Topology Search for Pareto-Optimal Photonic Tensor Cores},
year = {2025},
isbn = {9798400706356},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3658617.3697708},
abstract = {Photonic tensor cores (PTCs) are essential building blocks for optical artificial intelligence (AI) accelerators based on programmable photonic integrated circuits. Most PTC designs today are manually constructed, with low design efficiency and unsatisfying solution quality. This makes it challenging to meet various hardware specifications and keep up with rapidly evolving AI applications. Prior work has explored gradient-based methods to learn a good PTC structure differentiably. However, it suffers from slow training speed and optimization difficulty when handling multiple non-differentiable objectives and constraints. Therefore, in this work, we propose a more flexible and efficient zero-shot multi-objective evolutionary topology search framework ADEPT-Z that explores Pareto-optimal PTC designs with advanced devices in a larger search space. Multiple objectives can be co-optimized while honoring complicated hardware constraints. With only <3 hours of search, we can obtain tens of diverse Pareto-optimal solutions, 100\texttimes{} faster than the prior gradient-based method, outperforming prior manual designs with 2\texttimes{} higher accuracy weighted area-energy efficiency. The code of ADEPT-Z is available at link.},
booktitle = {Proceedings of the 30th Asia and South Pacific Design Automation Conference},
pages = {1077–1083},
numpages = {7}
}
Education
Ph.D. in Computer Engineering
Jan. 2024 - Present
Tempe, AZ
M.Sc. in Electrical Engineering
Jan. 2021 - Dec. 2022
Los Angeles, CA
B.Eng. in Electrical Engineering
Sep. 2016 - Jun. 2020
Shanghai, China
Experience
Graduate Research Assistant
Aug. 2025 - Present
Tempe, Az
Research Assistant | With Prof. Jiaqi Gu
Jan. 2024 - Aug. 2025
Tempe, AZ
Analog Engineer
Jan. 2023 - Dec. 2023
San Jose, CA
Research Assistant | With Prof. Baile Chen
Sept. 2018 - Jun. 2020
Shanghai, China