[io] don't miss writing a histogram that is only in a last file with option -n 2 #18679

ferdymercury · 2025-05-09T16:47:24Z

This Pull request:

Changes or fixes:

Fixes #9022

Explores solution suggested by @jblomer

fyi @will-cern

Checklist:

tested changes locally
updated the docs (if necessary)

…option -n 2 Fixes root-project#9022

github-actions · 2025-05-09T20:53:43Z

Test Results

19 files 19 suites 3d 17h 10m 42s ⏱️
2 745 tests 2 745 ✅ 0 💤 0 ❌
50 721 runs 50 721 ✅ 0 💤 0 ❌

Results for commit f312f3b.

♻️ This comment has been updated with latest results.

silverweed

Thank you for tackling this long-standing issue!
I left a small comment.

io/io/test/TFileMergerTests.cxx

copy paste typo

silverweed

LGTM but I'd wait for @pcanal 's approval as well

pcanal · 2025-05-12T19:58:28Z

io/io/src/TFileMerger.cxx

@@ -542,7 +542,8 @@ Bool_t TFileMerger::MergeOne(TDirectory *target, TList *sourcelist, Int_t type,
            keyname, keytitle);
      return kTRUE;
   }
-   Bool_t canBeFound = (type & kIncremental) && (current_sourcedir->GetList()->FindObject(keyname) != nullptr);
+   Bool_t canBeFound = (type & kIncremental) && (current_sourcedir->GetList()->FindObject(keyname) != nullptr) &&


Why is it && and not || ?

Or more exactly, I don't understand yet why the fact the histogram can be found means that it is not written in the end ...

Check out:

https://github.com/ferdymercury/root/blob/1531153ea11a7b54f4eb3c170bbd28e9bc46447f/io/io/src/TFileMerger.cxx#L808-L817

so if canBeFound is true, there is an optimization that spare some write cycles.

We use && to avoid it being true, ie to force writing to file. Using || would go in the different direction.

Do you want me to rename canBeFound to skipPartialWriting ?

Do you want me to rename canBeFound to skipPartialWriting ?

Not yet. I am still confused.

The original canBeFound meant (in the context of incremental merge) 'histogram can be found in the source' while the new version is histogram can be found in the source and in the target.

The optimization is indeed 'skip partial writing if we can found the histogram again'.

So I don't understand (yet) the semantic of the change. ie. Why is the new criteria the right choice? Is the new criteria instead 'just' making canBeFound always false?

Another avenue of inquiry is 'the original code assume that if canBeFound is true then there will be another change to write the histogram. Why is it no true anymore (it does not seem to be realted to 'can not be found in target')? Is there other variation of the example that also fails (how does the -n X value relates to the number of files in the input list and how many are 'missing' the histograms).

Related: is it possible that the alternative if that at the refresh boundary there needs to be a flush/write as if we were at the end?

Thanks for the review. I am not sure about these questions; I just followed jblomer's suggestion. My (limited) understanding is that this change just forces an extra partial write the first time that a new histogram appear in any file. So it does not really harm, but is suboptimal since, if all files have exactly the same histograms, then we could have waited until the end. But it makes it work if there are some files with and some without, independently of the chosen N.

Yes but when going through the first file, it would not be (yet) in both the input and output, so it would be (spuriously) written, or am I still missing something?

Hmm well, this is the result I get in debug mode:

Info in TFileMerger::MergeOne: Writing partial result of h1 into target Info in TFileMerger::MergeOne: Writing partial result of h3 into target Info in TFileMerger::MergeOne: Writing partial result of h2 into target

So... are you referring to the second line Writing partial result of h3 into target when you say spurious write?

I think so. Since h3 is in every file, shouldn't it not need any partial write?

Behaviour prior to this PR:

Info in <TFileMerger::MergeOne>: Writing partial result of h1 into target Info in <TFileMerger::MergeOne>: Writing partial result of h3 into target

This happens because for the first file, (type & kIncremental) is false.

Behavior after this PR:

Info in <TFileMerger::MergeOne>: Writing partial result of h1 into target Info in <TFileMerger::MergeOne>: Writing partial result of h3 into target Info in <TFileMerger::MergeOne>: Writing partial result of h2 into target

So, this PR is only improving things, right?

If I understand you correctly, on top of this improvement, you would like to additionally see h3 not partially written at all:

Info in <TFileMerger::MergeOne>: Writing partial result of h1 into target Info in <TFileMerger::MergeOne>: Writing partial result of h2 into target

to be even more performant, but what you suggest would require to traverse all the files in advance to know which key should be not partially written at all if it's in all files. I think that would be less efficient than one extra partial write in the first occurrence of a key throughout the list of files; right?

Ok, let's do a more elaborate example:

auto filename0 = "f0_9022.root"; auto filename1 = "f1_9022.root"; auto filename2 = "f2_9022.root"; auto outname = "file9022mergeroutput.root"; TFile f0(filename0, "RECREATE"); TH1F h0("h0", "h0", 1, 0, 1); h0.Write(); f0.Close(); TFile f1(filename1, "RECREATE"); TH1F h("h1", "h1", 1, 0, 1); h.Write(); TH1F h3("h3", "h3", 1, 0, 1); h3.Write(); f1.Close(); TFile f2(filename2, "RECREATE"); TH1F h2("h2", "h2", 1, 0, 1); h2.Write(); h3.Write(); f2.Close(); TFileMerger filemerger{false, false}; filemerger.SetMaxOpenedFiles(2); filemerger.OutputFile(std::unique_ptr<TFile>{TFile::Open(outname, "RECREATE")}); filemerger.AddFile(filename0); filemerger.AddFile(filename1); filemerger.AddFile(filename2); gDebug = 1; filemerger.Merge();

So before the PR we get:

Info in <TFileMerger::MergeOne>: Writing partial result of h0 into target

(because for first type, type&incremental is false).

And the output file only contains histogram h0.

After the PR:

Info in <TFileMerger::MergeOne>: Writing partial result of h0 into target Info in <TFileMerger::MergeOne>: Writing partial result of h1 into target Info in <TFileMerger::MergeOne>: Writing partial result of h3 into target Info in <TH1Merger::ExamineHistogram>: Examine histogram h3 - labels 1 - same limits 1 - axis found 1 Info in <TH1Merger::ExamineHistogram>: Examine histogram h3 - labels 1 - same limits 1 - axis found 1 Info in <TH1Merger::LabelMerge>: Merging histogram h3 into h3 Info in <TFileMerger::MergeOne>: Writing partial result of h2 into target

And the output file contains all histograms.

So the question is: why do you think it is a problem to have a first write whenever it appears? It's the only way to make it work so that the next appearance can call Merge. It's true that with a higher value of SetMaxOpenedFiles and with all files containing exactly the same hists, it's less optimum with this change, but still, this version is safer I guess?...

to see how often the partial result is written, as pcanal suggested

io/io/src/TFileMerger.cxx

ferdymercury added 2 commits May 9, 2025 18:46

[io] don't miss writing a histogram that is only in a last file with …

5111f69

…option -n 2 Fixes root-project#9022

[test][io] add merge test for single file hist with -n 2

cb44a2f

ferdymercury requested review from jblomer and silverweed May 9, 2025 16:59

ferdymercury marked this pull request as ready for review May 9, 2025 17:02

ferdymercury requested a review from pcanal as a code owner May 9, 2025 17:02

dpiparo assigned silverweed May 10, 2025

silverweed reviewed May 12, 2025

View reviewed changes

io/io/test/TFileMergerTests.cxx Outdated Show resolved Hide resolved

[io][test] remove stray comment

6a1e569

copy paste typo

ferdymercury requested a review from silverweed May 12, 2025 09:37

silverweed approved these changes May 12, 2025

View reviewed changes

pcanal reviewed May 12, 2025

View reviewed changes

[io][nfc] add a print for debugging/optimization purposes

e5d0915

to see how often the partial result is written, as pcanal suggested

ferdymercury commented May 14, 2025

View reviewed changes

io/io/src/TFileMerger.cxx Outdated Show resolved Hide resolved

[nfc] debug1

f312f3b

ferdymercury added this to the 6.38.00 milestone May 15, 2025

ferdymercury requested a review from pcanal June 6, 2025 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[io] don't miss writing a histogram that is only in a last file with option -n 2 #18679

[io] don't miss writing a histogram that is only in a last file with option -n 2 #18679

Uh oh!

ferdymercury commented May 9, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 9, 2025 •

edited

Loading

Uh oh!

silverweed left a comment

Uh oh!

Uh oh!

silverweed left a comment

Uh oh!

pcanal May 12, 2025

Uh oh!

pcanal May 12, 2025

Uh oh!

ferdymercury May 12, 2025

Uh oh!

pcanal May 13, 2025

Uh oh!

ferdymercury May 13, 2025

Uh oh!

pcanal Jun 6, 2025

Uh oh!

ferdymercury Jun 6, 2025 •

edited

Loading

Uh oh!

pcanal Jun 6, 2025

Uh oh!

ferdymercury Jun 6, 2025 •

edited

Loading

Uh oh!

ferdymercury Jun 6, 2025

Uh oh!

Uh oh!

Uh oh!

[io] don't miss writing a histogram that is only in a last file with option -n 2 #18679

Are you sure you want to change the base?

[io] don't miss writing a histogram that is only in a last file with option -n 2 #18679

Uh oh!

Conversation

ferdymercury commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

This Pull request:

Changes or fixes:

Checklist:

Uh oh!

github-actions bot commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

silverweed left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

silverweed left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ferdymercury Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ferdymercury Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ferdymercury commented May 9, 2025 •

edited

Loading

github-actions bot commented May 9, 2025 •

edited

Loading

ferdymercury Jun 6, 2025 •

edited

Loading

ferdymercury Jun 6, 2025 •

edited

Loading