(improvement) log a warning when importing of lz4 or snappy packages … #510

mykaul · 2025-08-06T14:32:09Z

…fails.

If it is not available, the driver will silently not use compression. Not very very silently, as you will see in debug level only something like: "No available compression types supported on both ends. locally supported: odict_keys([]). remotely supported: ['lz4', 'snappy']"

Make it a warning. I think it wouldn't be too noisy and is clear enough for the developer.

Pre-review checklist

I have split my patch into logically separate commits.
All commit messages clearly explain what they change and why.
I added relevant tests for new features and bug fixes.
All commits compile, pass static checks and pass test.
PR description sums up the changes and reasons why they should be introduced.
I have provided docstrings for the public items that I want to introduce.
I have adjusted the documentation in ./docs/source/.
I added appropriate Fixes: annotations to PR description.

Copilot

Pull Request Overview

This PR improves user experience by adding warning logs when optional compression packages (lz4 and snappy) fail to import. Previously, missing compression packages would only show debug-level messages when compression negotiation failed, making it difficult for developers to understand why compression wasn't working.

Added warning log messages when lz4 package import fails
Added warning log messages when snappy package import fails

cassandra/connection.py

Lorak-mmk · 2025-08-06T14:50:00Z

One more thing we could (and should!) do is to specify those dependencies - right now they are totally absent from pyproject.toml, which makes them hard to discover.
Those should be listed in [project.optional-dependencies] section. We could either make it a single extra, like compression = ['lz4', 'snappy'], or extra-per-lib, like

compression-snappy = ['snappy']
compression-lz4 = ['lz4']

mykaul · 2025-08-06T14:54:28Z

One more thing we could (and should!) do is to specify those dependencies - right now they are totally absent from pyproject.toml, which makes them hard to discover. Those should be listed in [project.optional-dependencies] section. We could either make it a single extra, like compression = ['lz4', 'snappy'], or extra-per-lib, like
compression-snappy = ['snappy']
compression-lz4 = ['lz4']

Yes, that's a different issue - it also means we never test compression.

mykaul · 2025-08-07T07:00:44Z

MacOS failure:

=================================== FAILURES ===================================
  __________________ StrategiesTest.test_nts_token_performance ___________________
  
  self = <tests.unit.test_metadata.StrategiesTest testMethod=test_nts_token_performance>
  
      def test_nts_token_performance(self):
          """
          Tests to ensure that when rf exceeds the number of nodes available, that we dont'
          needlessly iterate trying to construct tokens for nodes that don't exist.
      
          @since 3.7
          @jira_ticket PYTHON-379
          @expected_result timing with 1500 rf should be same/similar to 3rf if we have 3 nodes
      
          @test_category metadata
          """
      
          token_to_host_owner = {}
          ring = []
          dc1hostnum = 3
          current_token = 0
          vnodes_per_host = 500
          for i in range(dc1hostnum):
      
              host = Host('dc1.{0}'.format(i), SimpleConvictionPolicy)
              host.set_location_info('dc1', "rack1")
              for vnode_num in range(vnodes_per_host):
                  md5_token = MD5Token(current_token+vnode_num)
                  token_to_host_owner[md5_token] = host
                  ring.append(md5_token)
              current_token += 1000
      
          nts = NetworkTopologyStrategy({'dc1': 3})
          start_time = timeit.default_timer()
          nts.make_token_replica_map(token_to_host_owner, ring)
          elapsed_base = timeit.default_timer() - start_time
      
          nts = NetworkTopologyStrategy({'dc1': 1500})
          start_time = timeit.default_timer()
          nts.make_token_replica_map(token_to_host_owner, ring)
          elapsed_bad = timeit.default_timer() - start_time
          difference = elapsed_bad - elapsed_base
  >       assert difference < 1 and difference > -1
  E       assert (-1.130967961999886 < 1 and -1.130967961999886 > -1)

mykaul · 2025-08-07T07:07:21Z

Looks like a flaky test to me. I have no idea what the test does - measure it's not more than 1 s difference? not sure who set it as such. Anyway... I can retry or whatnot.

While trying to look at some random (flaky?) test (scylladb#510 (comment) ) I saw some (minor) improvements that can be made to make_token_replica_map(): 1. Remove some redundant len() calls to outside the loop(s) 2. Align some variable names, start with num_ ... for them. 3. Move token_offset and host assignment within the loop to closer to where it's used. All those are probably very very minor improvements, perhaps in a large cluster it'll be noticable. Signed-off-by: Yaniv Kaul <[email protected]>

dkropachev · 2025-08-07T17:26:35Z

@mykaul , can you please also bump log level to error for case when user picked compression, but there is no compression library available:

python-driver/cassandra/connection.py

Lines 1411 to 1414 in c7ca1c6

    
           log.debug("No available compression types supported on both ends." 
        
                     " locally supported: %r. remotely supported: %r", 
        
                     locally_supported_compressions.keys(), 
        
                     remote_supported_compressions)

It is probably makes sense to reduce log level for import errors to debug, warning is too high IMHO.

While trying to look at some random (flaky?) test (scylladb#510 (comment) ) I saw some (minor) improvements that can be made to make_token_replica_map(): 1. Remove some redundant len() calls to outside the loop(s) 2. Align some variable names, start with num_ ... for them. 3. Move token_offset and host assignment within the loop to closer to where it's used. 4. Only add DCs and hosts that are in the map All those are probably very very minor improvements, perhaps in a large cluster it'll be noticable. Signed-off-by: Yaniv Kaul <[email protected]>

…fails. If it is not available, the driver will silently not use compression. Not very very silently, as you will see in debug level only something like: "No available compression types supported on both ends. locally supported: odict_keys([]). remotely supported: ['lz4', 'snappy']" Make this log line a an error level log. Add to the import failure a debug level log. I think it wouldn't be too noisy and is clear enough for the developer. Signed-off-by: Yaniv Kaul <[email protected]>

mykaul · 2025-08-08T06:33:46Z

@mykaul , can you please also bump log level to error for case when user picked compression, but there is no compression library available:

python-driver/cassandra/connection.py

Lines 1411 to 1414 in c7ca1c6

log.debug("No available compression types supported on both ends."

" locally supported: %r. remotely supported: %r",

locally_supported_compressions.keys(),

remote_supported_compressions)

It is probably makes sense to reduce log level for import errors to debug, warning is too high IMHO.

Done.

mykaul requested a review from Copilot August 6, 2025 14:32

Copilot AI reviewed Aug 6, 2025

View reviewed changes

cassandra/connection.py Outdated Show resolved Hide resolved

mykaul force-pushed the log_on_import_failure branch from 7079a15 to 7db823d Compare August 6, 2025 14:54

mykaul mentioned this pull request Aug 7, 2025

(minor improvement): small refactoring in make_token_replica_map() #513

Open

8 tasks

mykaul force-pushed the log_on_import_failure branch from 7db823d to 6a5fab5 Compare August 8, 2025 06:33

mykaul mentioned this pull request Aug 8, 2025

Specify snappy and LZ4 dependencies - right now they are totally absent from pyproject.toml, which makes them hard to discover. #515

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(improvement) log a warning when importing of lz4 or snappy packages … #510

(improvement) log a warning when importing of lz4 or snappy packages … #510

Uh oh!

mykaul commented Aug 6, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Lorak-mmk commented Aug 6, 2025

Uh oh!

mykaul commented Aug 6, 2025

Uh oh!

mykaul commented Aug 7, 2025

Uh oh!

mykaul commented Aug 7, 2025

Uh oh!

dkropachev commented Aug 7, 2025 •

edited

Loading

Uh oh!

mykaul commented Aug 8, 2025

Uh oh!

Uh oh!

(improvement) log a warning when importing of lz4 or snappy packages … #510

Are you sure you want to change the base?

(improvement) log a warning when importing of lz4 or snappy packages … #510

Uh oh!

Conversation

mykaul commented Aug 6, 2025

Pre-review checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Lorak-mmk commented Aug 6, 2025

Uh oh!

mykaul commented Aug 6, 2025

Uh oh!

mykaul commented Aug 7, 2025

Uh oh!

mykaul commented Aug 7, 2025

Uh oh!

dkropachev commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mykaul commented Aug 8, 2025

Uh oh!

Uh oh!

dkropachev commented Aug 7, 2025 •

edited

Loading