Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

_kendall_p_exact error when really large datasets are run #13

Open
rmflight opened this issue Apr 30, 2024 · 3 comments
Open

_kendall_p_exact error when really large datasets are run #13

rmflight opened this issue Apr 30, 2024 · 3 comments

Comments

@rmflight
Copy link
Member

_kendall_p_exact is no longer at mstats_basic._kendall_p_exact, at least in scipy v 1.13.0

It seems to now be at mstats._mstats_basic._kendall_p_exact.

We probably need an import check whether the function can be reached, and maybe we should just make our own copy as part of the icikt codebase.

@rmflight
Copy link
Member Author

OK, I'm wrong. I'm not sure what is causing this, except that I get an error about not finding the function when I'm running the really large examples on KyRIC. Running smaller examples has no issues, and the error doesn't come up.

With the 45K case, I get this error:

AttributeError: `scipy.stats.mstats_basic` has no attribute `_kendall_p_exact`; furthermore, `
scipy.stats.mstats_basic` is deprecated and will be removed in SciPy 2.0.0.

However, running the exact same version of icikt on a small version of the data on KyRIC has no errors. I'm kind of flummoxed, honestly, as to why it fails with a really large dataset, but not with a much smaller one.

@rmflight rmflight changed the title scipy has moved _kendall_p_exact _kendall_p_exact error when really large datasets are run Apr 30, 2024
@hunter-moseley
Copy link
Member

hunter-moseley commented Apr 30, 2024 via email

@rmflight
Copy link
Member Author

I'm really not sure. It takes forever to run, so if there is a memory issue, it may be happening late in the game, or with later entries.

I can try tracking memory with gpu_tracker, or I'll write a little memory tracker script in R to log the memory usage every minute or so over the course of the calculation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants