rfc: add initial draft of pcap-cf #1309

maxmoehl · 2025-09-10T06:18:26Z

🚀 link for easy viewing

Co-Authored-By: Claude <[email protected]>

peanball

A few comments and points of discussion.

General:

I'm missing the "opt-in" nature and control of the feature and discussion of possible security risks when this feature is enabled overall. The "attacker" in this case is a rouge operator and that might be more of an organizational problem. But the app's operator shouldn't be able to override an org policy, etc.
Because this is an analog to bosh pcap, maybe we can explain the functionality and one of its main benefits (multiplexing of traffic captured across multiple instances) a bit earlier? Right now it's in the very last sentence of the very last section (before the references). If it's important it should be repeated.

toc/rfc/rfc-draft-pcap-cf.md

peanball · 2025-09-10T07:44:43Z

toc/rfc/rfc-draft-pcap-cf.md

+The challenge is providing this functionality while maintaining the security
+model of Cloud Foundry, where applications run in isolated, unprivileged
+containers.


Can we highlight here that this could be achieved by not generally elevating privileges but elevating privileges for clearly defined, narrow use cases?

Or rather: "Elevated privileges can easily be misused and it is paramount (or some less fancy word) that the security model of CF remains intact, while giving app developers and operators the choice and tools to be able to asses network traffic."

This also alludes to this feature being opt-in. You might not want to enable it permanently, and you may not want to enable it for some specific apps at all.

Maybe something like this?

Suggested change

The challenge is providing this functionality while maintaining the security

model of Cloud Foundry, where applications run in isolated, unprivileged

containers.

While platform traffic can be secured from eavesdropping, e.g. via mTLS, access to

the network traffic in a container will contain sensitive and possibly private

information.

Elevated privileges of any kind can easily be misused and it is paramount that the

security model of Cloud Foundry remains intact, where applications usually run in

isolated, unprivileged containers.

Any solution must consider that network capture is a privilege that has to be

enabled explicitly, not given by default and can be forbidden altogether.

toc/rfc/rfc-draft-pcap-cf.md

maxmoehl · 2025-09-10T08:53:24Z

Maybe a general note on the opt-in: there were no extensive discussions on what that "opt-in" would look like. In my opinion it should just be the existing SSH access and not a dedicated flag. Mainly because having SSH access already puts you in a position of being able to do a lot and we have this setup as a feature flag on the different levels of CF.

peanball · 2025-09-10T12:55:02Z

toc/rfc/rfc-draft-pcap-cf.md

+the various lifecycle archives that are added to the final app container and
+the necessary capabilities (`CAP_NET_RAW` and `CAP_NET_ADMIN`) will be assigned
+to the executable via file capabilities. This allows regular users to gain those
+capabilities when executing the binary.


Suggested change

capabilities when executing the binary.

capabilities when executing the binary.

The functionality is limited to network capturing with the aforementioned scope

of selected network interface, filtering expression in pcap-filter format and

length of captured packets (snaplen). This reduces the attack surface, compared

to invoking a full `tcpdump`.

(if it's important, repeat it ;-) )

peanball · 2025-09-10T12:57:41Z

toc/rfc/rfc-draft-pcap-cf.md

+Similar to the `bosh pcap` command a `cf pcap` command will be added. Like its
+predecessor it will connect to the desired instances via SSH and execute the new
+packet capturing tool and stream back the captured packets via stdout. If there
+are multiple streams, the CLI will merge them and write them out to a single
+file in the pcap format.


Suggested change

Similar to the `bosh pcap` command a `cf pcap` command will be added. Like its

predecessor it will connect to the desired instances via SSH and execute the new

packet capturing tool and stream back the captured packets via stdout. If there

are multiple streams, the CLI will merge them and write them out to a single

file in the pcap format.

Similar to the `bosh pcap` command, a `cf pcap` command will be added. Like its

counterpart, it will connect to the desired instances via SSH, execute the new

packet capturing tool and stream back the captured packets via stdout and thus via SSH.

If there are multiple streams, the CLI will merge them and write them out to a single

file in the pcap format.

"predecessor" makes it look like bosh pcap might be going away.

peanball · 2025-09-10T13:38:40Z

Maybe a general note on the opt-in: there were no extensive discussions on what that "opt-in" would look like. In my opinion it should just be the existing SSH access and not a dedicated flag. Mainly because having SSH access already puts you in a position of being able to do a lot and we have this setup as a feature flag on the different levels of CF.

I see your point. That said, capturing network traffic might show data that is not even stored plain-text in a database that you could gain access to via SSH. So maybe something worth discussing more extensively.

For future discussions and votes: My opinion is that network capture is a privilege beyond SSH.

ameowlia

Would a user be able to see unencrypted traffic contents?
What roles would be able to use this comment for a given app?
Would this feature always be on, or would there be a foundation wide flag?
Could there be a cf event to log when this action is taken?

toc/rfc/rfc-draft-pcap-cf.md

ameowlia · 2025-09-10T20:10:08Z

toc/rfc/rfc-draft-pcap-cf.md

+```bash
+# Capture HTTP traffic for myapp
+cf pcap myapp --interface eth0 --filter "tcp port 80" --snaplen 1500
+
+# Capture specific instance with custom filter
+cf pcap myapp --instance 1 --filter "host database.example.com"
+```
+


Can you give some examples of what the output would look like?

Done, please check again.

peanball · 2025-09-10T21:00:20Z

Some replies to Amelia's questions. @maxmoehl, please correct me if I'm wrong.

Would a user be able to see unencrypted traffic contents?

If they capture on the interface lo, they would.

What roles would be able to use this comment for a given app?

This needs to be fleshed out in the RFC a bit. I think it depends on the scenario. In a test org or space a group of developers may want to capture stuff, in a production org maybe only select people after prior approval.

Would this feature always be on, or would there be a foundation wide flag?

Foundation wide general flag, and ideally additional per org/space/app(?) flag.

Could there be a cf event to log when this action is taken?

That is a good idea, for traceability purposes.

maxmoehl · 2025-09-16T06:26:08Z

What roles would be able to use this comment for a given app?

This needs to be fleshed out in the RFC a bit. I think it depends on the scenario. In a test org or space a group of developers may want to capture stuff, in a production org maybe only select people after prior approval.

My current proposal (though I should maybe clarify this a bit more) does not introduce any additional permissions. As long as a user can SSH into the app they will be able to initiate a capture. That includes capturing plain-text traffic.

Would this feature always be on, or would there be a foundation wide flag?

Foundation wide general flag, and ideally additional per org/space/app(?) flag.

This is where it gets tricky. Foundational flag we can somehow make work via diego-release, it would need to control whether the binary is injected or not which makes this complicated as the injected binaries right now come via a BOSH package which is not configurable in any way. I would prefer not to add org/space/app flags beyond the SSH one¹.

Could there be a cf event to log when this action is taken?

That is a good idea, for traceability purposes.

This comes down to the feature just being SSH with a special binary. There won't be any interaction with the CF API beyond SSH, so no special audit log will be written.

I will also spend some time today addressing the remaining comments.

This also comes down to capacity. If we require a feature flag equivalent to the SSH one on every level I would need someone to help me make this work as it is quite a lot more work. ↩

rfc: add initial draft of pcap-cf

902e3e3

Co-Authored-By: Claude <[email protected]>

maxmoehl mentioned this pull request Sep 10, 2025

Enable app developers to perform privileged actions if the operator allows it cloudfoundry/diego-release#1023

Open

cf-foundation-community-automation bot added this to CF Community Sep 10, 2025

cf-foundation-community-automation bot moved this to Inbox in CF Community Sep 10, 2025

peanball reviewed Sep 10, 2025

View reviewed changes

first round of review feedback

3fa4a92

peanball reviewed Sep 10, 2025

View reviewed changes

ameowlia reviewed Sep 10, 2025

View reviewed changes

beyhan added toc rfc CFF community RFC labels Sep 12, 2025

second round of review feedback

463d354

maxmoehl marked this pull request as ready for review September 25, 2025 05:51

beyhan requested review from a team, rkoster, beyhan, Gerg, stephanme and cweibel and removed request for a team September 29, 2025 06:49

beyhan moved this from Inbox to In Progress in CF Community Sep 30, 2025

-The challenge is providing this functionality while maintaining the security
-model of Cloud Foundry, where applications run in isolated, unprivileged
-containers.
+While platform traffic can be secured from eavesdropping, e.g. via mTLS, access to
+the network traffic in a container will contain sensitive and possibly private
+information.
+Elevated privileges of any kind can easily be misused and it is paramount that the
+security model of Cloud Foundry remains intact, where applications usually run in
+isolated, unprivileged containers.
+Any solution must consider that network capture is a privilege that has to be
+enabled explicitly, not given by default and can be forbidden altogether.

-capabilities when executing the binary.
+capabilities when executing the binary.
+The functionality is limited to network capturing with the aforementioned scope
+of selected network interface, filtering expression in pcap-filter format and
+length of captured packets (snaplen). This reduces the attack surface, compared
+to invoking a full `tcpdump`.

rfc: add initial draft of pcap-cf #1309

Are you sure you want to change the base?

rfc: add initial draft of pcap-cf #1309

Conversation

maxmoehl commented Sep 10, 2025 • edited by ameowlia Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

peanball left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

peanball Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

peanball Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maxmoehl commented Sep 10, 2025

Uh oh!

peanball Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

peanball Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

peanball commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ameowlia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ameowlia Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

maxmoehl Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

peanball commented Sep 10, 2025

Uh oh!

maxmoehl commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Footnotes

Uh oh!

Uh oh!

maxmoehl commented Sep 10, 2025 •

edited by ameowlia

Loading

peanball Sep 10, 2025 •

edited

Loading

peanball commented Sep 10, 2025 •

edited

Loading

maxmoehl commented Sep 16, 2025 •

edited

Loading