Flaky Integration Test(s) #3366

reedsa · 2024-04-26T16:34:27Z

What happened?

Every now and then we get failures in CI from tests that appear to flake. Investigate and fix the flakiness.

Running with pytest-flakefinder shows exactly where things break down. I found that running all tests with in TestGoEthereumAsyncEthModuleTest with the flakefinder, tests/integration/go_ethereum/test_goethereum_ws/test_async_ctx_manager_w3.py::TestGoEthereumAsyncEthModuleTest::test_eth_modify_transaction passes the first 30ish times or so but then hangs for a bit and starts failing.

It appears that the request ID is not being matched with the cached response properly, if the response even exists. The test itself modifies the transaction but I'm not sure if it would effectively overwrite the original transaction in the cache. Perhaps this is why the IDs get messed up or the queue is not the expected size. Still digging.

After further investigation, it seems that slowing down the test with sleep or even using wait_for_transaction_receipt will cause the failure to happen on the 14th run.

Added @flaky_geth_dev_mining but it isn't working either.

Code that produced the error

pytest tests/integration/go_ethereum/test_goethereum_ws/test_async_ctx_manager_w3.py::TestGoEthereumAsyncEthModuleTest

Full error output

No response

Fill this section in if you know how this could or should be fixed

Will need to investigate. Could be some context somewhere that's getting reused, like an object that should be copied instead of assigned.

web3 Version

No response

Python Version

No response

Operating System

No response

Output from `pip freeze`

No response

The text was updated successfully, but these errors were encountered:

reedsa · 2024-04-26T20:44:44Z

We discussed changing the websocket provider so that replace_transaction handles the transaction properly. The original request should be removed from the cache so that there is no expectation for that response to come back.

fselmo · 2024-07-25T22:39:58Z

It doesn't seem like the modify transaction test is failing anymore, at least on a consistent way. We should probably try to observe this for a while and close this out if that's the case. #3440 Addressed some very flaky tests at least so our suite should be in a decent place compared to when we first rolled out the geth --dev refactor.

pacrob added priority: p3 normal labels May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flaky Integration Test(s) #3366

Flaky Integration Test(s) #3366

reedsa commented Apr 26, 2024 •

edited

Loading

reedsa commented Apr 26, 2024

fselmo commented Jul 25, 2024

Flaky Integration Test(s) #3366

Flaky Integration Test(s) #3366

Comments

reedsa commented Apr 26, 2024 • edited Loading

What happened?

Code that produced the error

Full error output

Fill this section in if you know how this could or should be fixed

web3 Version

Python Version

Operating System

Output from pip freeze

reedsa commented Apr 26, 2024

fselmo commented Jul 25, 2024

reedsa commented Apr 26, 2024 •

edited

Loading

Output from `pip freeze`