Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Facilitate access to S3 buckets #67

Closed
jfigui opened this issue Aug 27, 2024 · 10 comments
Closed

Facilitate access to S3 buckets #67

jfigui opened this issue Aug 27, 2024 · 10 comments

Comments

@jfigui
Copy link
Collaborator

jfigui commented Aug 27, 2024

Reminder that at some point we would like to be able to work directly with data stored in S3 buckets

@wolfidan wolfidan pinned this issue Sep 8, 2024
@jfigui
Copy link
Collaborator Author

jfigui commented Sep 30, 2024

Hi @wolfidan,

I have pushed a first version on commit b4187d8 in dev.

Please have a look at it and let me know what you think.

@jfigui
Copy link
Collaborator Author

jfigui commented Sep 30, 2024

Hi @wolfidan ,

I pushed several enhancements in dev.

You can now work with data stored in s3 buckets for data types "ODIM", "ODIMBIRDS", "CFRADIAL", "CFRADIAL2", "CF1", "NEXRADII", "GAMIC", "ODIMGRID", "KNMIH5GRID" and "SKYECHO".

I had to do a significant revamping of read_data_radar.py but I think that with what I have done we are on path to simplify our data reading functions.

Please have a close look to what I have done and let me know if you find any issues. It would be good to be sure that I did not break anything before pushing to the master and I had a limited time for testing.

@wolfidan
Copy link
Collaborator

Hi @jfigui ,

Looks great, thanks a lot for your work, I didn't get the time today, but I will do it tomorrow for sure,

@wolfidan
Copy link
Collaborator

wolfidan commented Oct 1, 2024

At the moment, the tests fail because compatibility is broken in https://github.com/MeteoSwiss/pyrad/blob/dev/src/pyrad_proc/pyrad/io/io_aux.py get_file_list
The issue is that the indexing cfg["datapath"][ind_rad] is not done correctly for some cases, for example at l.2984 we have datapath = (
f'{cfg["datapath"]}{dayinfo}/{basename}/')
since cfg["datapath"] is a list this ends up in a wrong file pattern.

Another issue is that you try to use cfg["DataTypeIDInFiles"] in the nexrad reader. I don't think this works. I would suggest removing it.

@wolfidan
Copy link
Collaborator

wolfidan commented Oct 2, 2024

@jfigui
Copy link
Collaborator Author

jfigui commented Oct 2, 2024

Thanks for the feedback.

I corrected some bugs and it passes the ubuntu tests now but not the windows one. It fails when setting up miniconda. I do not see how this can be related to the changes in the code. Can you have a look?

@wolfidan
Copy link
Collaborator

wolfidan commented Oct 3, 2024

Hi @jfigui , thanks, sorry I am taking some vacation, but I will try to look into it asap

@wolfidan
Copy link
Collaborator

wolfidan commented Oct 3, 2024

Managed to make the tests run

@jfigui
Copy link
Collaborator Author

jfigui commented Oct 4, 2024

Great! Thanks a lot! Let's keep this issue open until we can use s3 buckets with any file.

@wolfidan
Copy link
Collaborator

Closing as taken care of by 4f6d353 and (4f6d353)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants