Data-Intensive Discovery Accelerated by Computational Techniques for Science (DIDACTS)

How do we realize the full impact of machine learning in the physical sciences? That is the core question that the DIDACTS project centers around. The specific challenge is that physical sciences are at a tipping point whereas current machine learning methods do not adequately address their needs.

Research information

Quad Chart description of project

(Click here for a PDF of the above Quad Chart)

As basic sciences including seismology, meteorology, materials science, and others, are becoming data intensive, we are reaching a tipping point, where identifying and utilizing meaningful signals in noisy data become increasingly difficult and require innovative, effective computational methods. A particularly challenging problem in the physics domain is the identification of Dark Matter (and properties of other elementary particles). Eighty-five percent of our Universe comprises something that we do not understand: Dark Matter. It binds the whole Universe together; without it, galaxies would not form and life would not exist. Yet we have no experimental knowledge of its properties.

Astroparticle physics experiments, characterized by massive amounts of extremely noisy spatiotemporal sensor data, provide an ideal development ground for computational methods that will be broadly applicable in experimental sciences. Our earlier studies have demonstrated that current neural network- based methods are not a good fit for encoding prior physical knowledge, and for faithfully representing physical constraints in these experiments. We propose going back to foundational probabilistic modeling and inverse problems, into which physical constraints and prior knowledge can be readily incorporated. This proposal brings together a well-qualified interdisciplinary team, comprising an astrophysicist working on particle physics detectors, a machine learning researcher with much interdisciplinary collaborative experience in the sciences, and an engineer with extensive expertise in signal processing, statistics. and inverse problems. The team is thus ideally positioned to make significant contributions advancing both data science and its impact on data-driven scientific disciplines.

Deliverables

Most of our non-paper content can be found in our Zenodo community, including anything below that isn’t directly citable. Publications and codes will be linked here once available. Repositories will be within our Github organization.

Papers

A Method for Quantifying Position Reconstruction Uncertainty in Astroparticle Physics using Bayesian Networks
arXiv:2205.10305
Domain-informed neural networks for interaction localization within astroparticle experiments
arXiv:2112.07995 Front. Artif. Intell. 5, 832909 (2022)

Highlighted talks at workshops

Quantifying Reconstruction Uncertainty using Probabilistic Graphical Models at Community Laboratory for AL Research at the Intersection with Physics (CLARIPHY) Topical Meeting talk
DIDACTS talk at Neutrino Physics and Machine Learning talk, you can also find a recorded video here
GNN reconstruction progress talk at DANCE-ML 2020 talk

Other deliverables

Learning Product Graphs Underlying Smooth Graph Signals, arXiv:2002.11277
Sensitivity of the DARWIN observatory to the neutrinoless double beta decay of 136Xe, arXiv:2003.13407
Processing of Graph Structured Data: An Introduction
SNEWS 2.0 Cyberinfrastructure Agile Scrum Development in an ad hoc Software Collaboration
SIAM CSE21 March 2021: Graph Convolutional Neural Networks for Position Reconstruction in XENON1T

Posters

NSF_poster (Click here for a PDF of the above poster)

Project leads

Please feel free to reach out to any of our PIs to discuss possible collaboration.

PI Christopher Tunnell, Rice University, Physics and Astronomy (more info)
Co-PI Waheed Bajwa, Rutgers, Electrical and Computing Engineering / Statistics (more info)
Co-PI Hagit Shatkay, U. Delaware, Computer and Information Sciences (more info)

Team

Aaron Higuera, Rice University, Physics and Astronomy
Shixiao Liang, Rice University, Physics and Astronomy
Tina Peters, U. Delaware, Computer and Information Sciences
Venkat Roy, Rutgers, Electrical and Computing Engineering / Statistics

Workshops

A core part of our work is engaging with communities within the physical sciences that face similar challenges through a series of workshops each year. So far, we have run the following workshops:

DANCE-ML 2020 a forum to share and discuss machine learning applications in the context of the dark matter direct detection and neutrino physics community.
DANCE 2019 focused on computational challenges to neutrino and dark-matter science

Press

Project start
- Delaware
- Rice
- Rutgers
- KnowInnovation

Funding

This work is supported by the National Science Foundation as part of it’s Harnessing the Data Revolution Big Idea (Soliciation 19-543) through awards 1940074, 1940209, and 1940080.