Commit 6fb3db71 authored by Greulich, Christopher's avatar Greulich, Christopher
Browse files

Add new file

parent cc823bd3
Loading
Loading
Loading
Loading

2025-CoDA/readme.md

0 → 100644
+13 −0
Original line number Diff line number Diff line
# CoDA 2025 - Conference on Data Analysis
### February 25-28, 2025 in Santa Fe, New Mexico
### Exploring Data-Focused Research across the Department of Energy

The conference website is [located here](https://web.cvent.com/event/7845571b-b15d-418c-a24a-14468480c4ff/summary). 

I presented a poster titled "Dataset and Machine Learning Methods for Elemental Chemical Separations".

## Poster:
![](CoDA_poster_final.PNG)

## Abstract:
A group at Oak Ridge national laboratory is developing an automated system that studies the performance of elemental chemical separations via extraction chromatography. Performing extraction chromatography by hand is a time-consuming process and it is of interest to speed up this process. We describe an ongoing literature survey that data mined nearly 10,000 individual separations. The database indexes 41 different resins, 2 acids, 77 elements (focusing mainly on transition metals, actinides, and lanthanides), and various acid concentrations which vary by 5 orders of magnitude. In addition to those independent variables, the data base includes a dependent variable which is the coefficient of separation efficiency or performance referred to as the distribution coefficient and stylized as K_d. This distribution coefficient is useful in designing a series of separations that maximally separate different elemental and molecular species for enhanced downstream analysis, medical treatments, use as a purified feedstock for manufacturing, as well as for many other uses.  This work focuses on the use of machine learning to estimate K_d value from the independent variables. The poster will describe the results as well as discuss some of the challenges of the datasets, such as sparsity.