Skip to content

Metadata availability in dataframes #18873

Open
@acampove

Description

@acampove

Explain what you would like to see improved and how.

Hi,

I am trying to debug a problem, (maybe in RDataFrame...) and I have built a dataframe from a set of files using FromSpec. The problem seems to emerge for certain subsets of entries. Once I am in those entries, I would like to know more about the source of those entries, like the file name/path. Ideally I would do something like:

names = rdf.GetFileNames()

and this would tell me from what files this particular dataframe has data. The usecase is that the dataframe might be made of hundreds of files and we might have an issue in some sections, corresponding to certain files. The problem could be identified like:

rdf_bad = rdf_all.Filter('a == -999')

Does anything like this exist? I think it makes sense to think of implementing stuff like this to ease debugging.

Cheers.

ROOT version

NA

Installation method

NA

Operating system

NA

Additional context

No response

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions