You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been working on a revised version of datafusion ray for the past month or so. I submitted a PR today in fact.
It replaces the disk based shuffle with ArrowFlight streaming shuffle, and is able to scale beyond one node.
It should be in a place where it can build and work for you. You can read the PR here
@robtandy Thanks for the response. Since datafusion-ray is not published to pypi.org, finding it difficult to install in a python project and experiment with it. Without a datafusion-ray package in pypi, building it from source and packaging it as a dependency on another project etc., becomes too complex.
Hi
We are trying to use datafusion-ray. To get started we tried to pip install datafusion-ray. However
datafusion-ray
is not published.I tried installing directly from github like this :
Failure reason is not displayed even in
--verbose
mode.If someone wants to do datafusion + ray, should we continue using raysql till then ?
I would like to contribute here if you are open for it.
The text was updated successfully, but these errors were encountered: