Support to Convert .fit Results to CSV (or any format) by kevScheuer · Pull Request #393 · JeffersonLab/halld_sim

kevScheuer · 2026-06-03T12:59:29Z

This request is to merge a script and set of classes that will allow any Amptools-based analysis to convert their .fit results into a comma-separated value (CSV) file. Several plotters already exist for analyzing fit results per bin, and these are very well suited for analyzing the angular distributions, but mass-independent fits must "stitch" together their fit results to observe any behavior of the amplitudes and phases across mass bins. In addition, the 100s of fit results produced by bootstrap or randomized fits have no standard way to be aggregated. This CSV converter is designed to fill this gap in the analysis process. Below I've provided a short description for each component added.

`convert_to_csv`

This is the primary script that users will interact with. A user with several fit results result_1.fit, result_2.fit... can simply execute

user@ifarm:~$ convert_to_csv -i dir/result_*.fit

and a CSV will be made where each row corresponds to the .fit file, and the columns indicate AmpTools fit outputs, parameters, intensities, and phase differences.

This CSV can then be read into a Python Pandas dataframe, ROOT tree or dataframe, or used by practically any programming language, and then plotted. The script is designed to be as generic as possible, so that any AmpTools-based analysis can use it. Listed are some more highlighted features of the script

Info about the data (number of events, t_bin info, mass bin info, etc.) the fit was run on is extracted with the --data-file flag. It will read the associated data (with optional weights and/or background) files of the result and extract the info to a CSV file
- Different reactions are supported via the --lower-vertex-indices flag. This tells the ROOTDataConverter which 4-vector indices correspond to the upper or lower vertex, thus allowing the correct calculation of the mass and $-t$ info
  - By default, the reaction is assumed to simply be a recoil proton
Can produce covariance, correlation, and normalization integral matrices
Identifies coherent sums according to the amplitude naming scheme, which can be explicitly set via --naming-scheme
- See AmplitudeParser for more details

`FitConverter`

Handles the .fit -> .csv conversion. This class stores:

Standard fit outputs (likelihood, events, status codes)
Parameters
Production Coefficients
Intensities of unique amplitudes (see here for more explanation)
Coherent sums of amplitudes by quantum number (see AmplitudeParser below)
Phase Differences between amplitudes

Currently supports .fit -> .csv conversion, but can easily be expanded to any file format desired. This is because all the results of interest are stored in various maps, and so writing to CSV is as easy as iterating over the maps.

`ROOTDataConverter`

This class is responsible for extracting the PWA-related information from a ROOT file. It stores:

Bin edges, centers, averages, and RMS values for $-t$, beam energy, and upper vertex masses
Number of events and detector efficiency

Just like the FitConverter, any file format beyond CSV can be used. To get the info, the class uses the data and monte carlo files associated with the fit. If available, it also properly incorporates event weights or background files. As discussed above, to calculate the mass and $-t$ info, the user specifies the 4-vector indices.

`AmplitudeParser`

This was the biggest hurdle for generalizing the converter. A lot of times we are not just interested in the individual amplitudes and phases, but their (in)coherent sums, like "total reflectivity contribution" or "behavior of JL waves summed over the spin-projections". The problem is that these sums are typically defined manually, because the amplitudes (and thus their quantum numbers) are user defined. The only way to identify them for grouping is by identifying the naming scheme of the amplitude, but not everyone uses the same scheme.

This class tries to identify the amplitude naming scheme used, and defines a set of possible sums based off the quantum numbers given in the scheme. It currently supports:

JLme - the current recommended generic format
eJPmL - used for some vector-pseudosalar analyses
Lme - common scheme for 2-pseudoscalar analysis
but can be easily extended to other schemes by users.

Updates from previous version

For those using the older standalone version of this script shown in the last tutorial, I figure its worth it to list some key differences:

Fast - pure C++, instead of the clunky python -> subprocess -> ROOT interpreter being done before
Easy Start - no longer have to setup python envs and ROOT paths, it's all immediately available in halld_sim now
Generalized - data files don't have to be separately called, and all types of analyses should be supported now.

Converter directory now reflects that any other data converters may be added in the future, not just CSV.

The parameters are now saved with their errors. The verbose flag now controls the amount of output during processing.

Was requiring that amplitudes with common amp names in "reaction::sum::ampName" format be constrained to each other. Now it will save the mapping for unique amplitude groups, e.g. "ampName", "sum::ampName", or the full "reaction::sum::ampName" strings.

Files are accessed so many times it makes more sense to save them. File loading happens in the constructor now. Also added a background file bool for easy tracking of whether or not the background files are present. A template for getting the -t values is also added, but not yet implemented. This will also effect how the other distributions are handled.

The largest addition is a function that extracts the values of interest for the beam energy, which incorporates signal and background subtraction. To help with this, a min/max finder function was added to find a common min/max value for a branch across files. A few other report lines were added, and some fixes to compile properly.

Uses a RDataFrame method to compute t from the various 4-vector component branches, then fills a histogram with the t values. If background files are present, also computes a background histogram and subtracts it from the data histogram before calculating statistics. Aside from this, small reports and comments were added.

Removed the mass-branch arg, as the mass can be calculated from the labeled 4-vectors. The indices can now be set by the user. Aside from this, the files have been formatted.

Having the functions return the created histogram makes it: 1. Easier to understand the purpose of the function, and doesn't hide the map filling in the implementation 2. Allows for possibility of printing the hist for debugging purposes Also added a function to return the total number of events and its error

In order to save the coherent sums, a new AmplitudeParser class was created to parse the amplitude names and categorize them into groups based on the quantum numbers they contain. This relies on known "naming schemes" for the amplitudes. Currently the most common schemes are supported, with instructions for how to add new schemes.

Also added a quick method to get the reaction string, which was helpful for the normInt functions. This commit also includes some formatting.

gluex · 2026-06-03T13:40:35Z

Test status for this pull request: SUCCESS

Summary: /work/halld/pull_request_test/halld_sim^csv_converter/tests/summary.txt
Logs: /work/halld/pull_request_test/halld_sim^csv_converter/tests/log

Build log: /work/halld/pull_request_test/halld_sim^csv_converter/make_csv_converter.log
Build report: /work/halld/pull_request_test/halld_sim^csv_converter/report_csv_converter.txt
Location of build: /work/halld/pull_request_test/halld_sim^csv_converter

gluex · 2026-06-21T17:45:19Z

Test status for this pull request: SUCCESS

Summary: summary.txt
Logs: results/log

Build log: make_csv_converter.log
Build report: report_csv_converter.txt

lihaoahil · 2026-07-03T21:07:29Z

Thanks for developing this converter — I tested it on a few fit outputs and it works for standard partial-wave-style fits where the amplitude naming follows the usual expected structure.

I did run into a crash for an SDME fit with a slightly different but still valid AmpTools configuration, where the intensity function has multiple factors in one sum. This is different from the mass-independent PWA-style fits that I tested successfully, though similar structures may also appear in some mass-dependent PWA fits. In my case the .fit file contains amplitudes like

amplitude twoPS0::SDMEext::kstar KStarHyperonExtended ...
amplitude twoPS0::SDMEext::kstar BreitWigner ...

The crash happens in FitConverter::sumAmpNamesAreConstrained(), where unique_sum_amp is assumed to contain "::" before being split into sum and amp. With my fit, the debug output shows

DEBUG sumAmpNamesAreConstrained: unique_sum_amp = SDMEext, sum = SDMEext, amp = DMEext, shared_terms.size() = 0

So unique_sum_amp.find("::") returns npos, and then substr(find("::") + 2) produces a malformed amplitude name. The following termList("", sum, amp) returns an empty vector, and the code later accesses shared_terms[0], which causes the segfault.

I think this is mostly a robustness issue rather than a problem with the general converter logic. It would be helpful if the converter could fall back to full amplitude names when the shortened naming scheme cannot be inferred reliably, rather than crashing.

I have made the test .fit files for both PWA and doubleSDME and related input available here: /w/halld-scshelf2101/home/haoli/analysis/testIO_kpi/test_Kevin

kevScheuer added 30 commits January 30, 2026 14:09

initial commit for converter script

bae87e6

add needed sconscript files

afeb6f4

added cli args and helper functions

dc30e7c

Modify sconscript files and rename converter dir

b970ad7

Converter directory now reflects that any other data converters may be added in the future, not just CSV.

revert so binary properly created

8b012b0

Basic FitConverter functionality completed

5766b6f

Added CSV output

09c467f

The parameters are now saved with their errors. The verbose flag now controls the amount of output during processing.

Add cov/corr matrix conversions

b268a0d

basic structure of root data converter done

c1981a3

minor todo edit

1db3e51

Improved some comments and variable names

7000311

added comments

ac20908

add some basics for rootDataConverter in csv script

0a6ce6d

removed deprecated header file

0428e9b

added todo

5cbccc0

added some todos and cleaned up comments

91f04a0

made function for determining upper vertex indices

52b6b97

Added a function for calculating upper vertex mass

17a2557

Lower vertex indices have been fully implemented

fc3472b

Removed the mass-branch arg, as the mass can be calculated from the labeled 4-vectors. The indices can now be set by the user. Aside from this, the files have been formatted.

updated comment and added error check

2c63a7f

Added efficiency & acc-correction + formatting

5d4c8b7

Added CSV methods to data converter

1de878c

added usage example

d1c8696

add the naming_scheme, and some comments

fe35b58

kevScheuer added 9 commits May 7, 2026 22:57

changed cout and cerr to report

b03c845

Added way to get normalization integral matrix

5747436

Also added a quick method to get the reaction string, which was helpful for the normInt functions. This commit also includes some formatting.

updated docstring

920ddab

docstring update

c083b54

Merge branch 'master' into csv_converter

04867c9

Updated dates and some comments

cc843c4

removed forgotten cout line

904cd5d

fixed function description

958d162

Add estDistToMinimum to standard output

ad6bf3d

Merge branch 'master' into csv_converter

03189d5

kevScheuer requested review from amschertz, edbarriga, harsimranhs and lihaoahil and removed request for edbarriga June 30, 2026 14:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support to Convert .fit Results to CSV (or any format)#393

Support to Convert .fit Results to CSV (or any format)#393
kevScheuer wants to merge 40 commits into
masterfrom
csv_converter

kevScheuer commented Jun 3, 2026

Uh oh!

gluex commented Jun 3, 2026

Uh oh!

gluex commented Jun 21, 2026

Uh oh!

lihaoahil commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

kevScheuer commented Jun 3, 2026

convert_to_csv

FitConverter

ROOTDataConverter

AmplitudeParser

Updates from previous version

Uh oh!

gluex commented Jun 3, 2026

Uh oh!

gluex commented Jun 21, 2026

Uh oh!

lihaoahil commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

`convert_to_csv`

`FitConverter`

`ROOTDataConverter`

`AmplitudeParser`