Python: fix load_chunk to temporary #913

ax3l · 2021-01-29T06:49:10Z

Keep a reference in load_chunk until data is not used anymore.

Follow-up to #912 and related to #833

franzpoeschel · 2021-01-29T11:01:01Z

We might want to be careful not to do this twice ;) @ax3l
#901

I needed to implement something very similar for that PR

ax3l · 2021-01-29T20:34:05Z

Yes, that's the follow-up PR to address the additional issue you mentioned and add an overload to allow in-memory reads:

Note: there is no in-memory overload for load_chunk yet exposed in Python.

I think it is still possible to trigger this bug in the load_chunk version. Notice that we create a new py::array on this line and pass it via shareRaw to the C++ API a few lines later. If the returned array is garbage-collected before the next flush, the next flush will write to free'd memory. That won't really happen in any sensible workflow, but we should still try to make it somewhat impossible to trigger memory errors from Python, and this one can be fixed much the same way that we already fixed the write-side API in this PR.

src/binding/python/RecordComponent.cpp

test/python/unittest/API/APITest.py

ax3l · 2021-01-29T21:26:13Z

test/python/unittest/API/APITest.py

@@ -1636,6 +1636,11 @@ def writeFromTemporaryStore(self, E_x):
        data = np.array([[1, 2, 3]], dtype=np.dtype("int"))
        E_x.store_chunk(data)

+    def loadToTemporaryStore(self, r_E_x):
+        # not catching the return value shall not result in a use-after-free:
+        r_E_x.load_chunk()


@franzpoeschel it looks like these tests does not trigger the issue we were concerned about.

Maybe the GC zeroes out the data it frees or we are otherwise lucky... But then we should see this throw: https://github.com/openPMD/openPMD-api/blob/0.13.1/include/openPMD/RecordComponent.hpp#L334-L335

Do you have another Python snippet we could add in the test here that demonstrates the load_chunk issue?

I'd say this is likely a combination of (1) the rather small dataset that we test this on, (2) the fact that py::array initializes its data upon constructon and (3) that we don't allocate anything in between. So, maybe we could trigger this by making things go a bit wilder.

https://github.com/franzpoeschel/openPMD-api/tree/topic-pyStoreChunkInMem-increaseBuffersize
This branch triggers the issue on my machine and your implementation fixes it.

Great hint, thank you - increasing the tested memory triggers the issue reliably :)

src/binding/python/RecordComponent.cpp

ax3l · 2021-01-29T21:36:18Z

We might want to be careful not to do this twice ;) @ax3l
#901

I needed to implement something very similar for that PR

@franzpoeschel sorry, there are too many LOC and feature editions in #901 that I did not see the partly overlap 🙈

I splitted my PR in a bugfix (this PR) and a feature addition (#914) to make them concise, backportable and easy to review. Let's see how we mingle them together.

Is #901 getting smaller once its dependent PR is merged and it can be rebased? :)

franzpoeschel · 2021-02-08T11:08:43Z

Is #901 getting smaller once its dependent PR is merged and it can be rebased? :)

A bit, but not drastically. Since this PR is the more "precise" one, I'd say go forward with implementing things here. I'll deal with getting things compatible during rebasing afterwards.

This should trigger a use-after-free.

Fix a use-after-free with load_chunk if the user discards the returned object before flush.

* Python Test: Discard loaded chunk This should trigger a use-after-free. * Fix Python: discarded load_chunk Object Fix a use-after-free with load_chunk if the user discards the returned object before flush.

ax3l added frontend: Python3 api: new additions to the API labels Jan 29, 2021

ax3l force-pushed the topic-pyStoreChunkInMem branch 2 times, most recently from 4c7e3c0 to aab4d89 Compare January 29, 2021 07:06

ax3l commented Jan 29, 2021

View reviewed changes

src/binding/python/RecordComponent.cpp Show resolved Hide resolved

ax3l commented Jan 29, 2021

View reviewed changes

test/python/unittest/API/APITest.py Show resolved Hide resolved

ax3l mentioned this pull request Jan 29, 2021

Python: store_chunk(temporary) #912

Merged

2 tasks

ax3l force-pushed the topic-pyStoreChunkInMem branch 3 times, most recently from 129d261 to 512d8c2 Compare January 29, 2021 21:13

ax3l added affects latest release bug labels Jan 29, 2021

ax3l commented Jan 29, 2021

View reviewed changes

ax3l changed the title ~~[Draft] Python: in-memory version of load_chunk~~ [Draft] Python: fix load_chunk to temporary Jan 29, 2021

ax3l removed the api: new additions to the API label Jan 29, 2021

ax3l mentioned this pull request Jan 29, 2021

[WIP] Python: in-memory version of load_chunk #914

Open

1 task

ax3l changed the title ~~[Draft] Python: fix load_chunk to temporary~~ Python: fix load_chunk to temporary Jan 29, 2021

ax3l commented Jan 29, 2021

View reviewed changes

src/binding/python/RecordComponent.cpp Outdated Show resolved Hide resolved

ax3l force-pushed the topic-pyStoreChunkInMem branch from dc7444f to 4c9f9fb Compare January 29, 2021 21:43

Python Test: Discard loaded chunk

4675b01

This should trigger a use-after-free.

ax3l force-pushed the topic-pyStoreChunkInMem branch from 47a0200 to 4675b01 Compare February 22, 2021 07:44

Fix Python: discarded load_chunk Object

86c3241

Fix a use-after-free with load_chunk if the user discards the returned object before flush.

ax3l merged commit e743c03 into openPMD:dev Feb 22, 2021

ax3l deleted the topic-pyStoreChunkInMem branch February 22, 2021 08:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Python: fix load_chunk to temporary #913

Python: fix load_chunk to temporary #913

Uh oh!

ax3l commented Jan 29, 2021 •

edited

Loading

Uh oh!

franzpoeschel commented Jan 29, 2021

Uh oh!

ax3l commented Jan 29, 2021

Uh oh!

Uh oh!

Uh oh!

ax3l Jan 29, 2021 •

edited

Loading

Uh oh!

franzpoeschel Feb 16, 2021

Uh oh!

franzpoeschel Feb 16, 2021

Uh oh!

ax3l Feb 22, 2021

Uh oh!

Uh oh!

ax3l commented Jan 29, 2021 •

edited

Loading

Uh oh!

franzpoeschel commented Feb 8, 2021

Uh oh!

Uh oh!

Python: fix load_chunk to temporary #913

Python: fix load_chunk to temporary #913

Uh oh!

Conversation

ax3l commented Jan 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

franzpoeschel commented Jan 29, 2021

Uh oh!

ax3l commented Jan 29, 2021

Uh oh!

Uh oh!

Uh oh!

ax3l Jan 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

franzpoeschel Feb 16, 2021

Choose a reason for hiding this comment

Uh oh!

franzpoeschel Feb 16, 2021

Choose a reason for hiding this comment

Uh oh!

ax3l Feb 22, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ax3l commented Jan 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

franzpoeschel commented Feb 8, 2021

Uh oh!

Uh oh!

ax3l commented Jan 29, 2021 •

edited

Loading

ax3l Jan 29, 2021 •

edited

Loading

ax3l commented Jan 29, 2021 •

edited

Loading