Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU example #132

Closed
jbednar opened this issue Mar 31, 2016 · 10 comments
Closed

GPU example #132

jbednar opened this issue Mar 31, 2016 · 10 comments

Comments

@jbednar
Copy link
Member

jbednar commented Mar 31, 2016

Since Numba has good support for using GPUs, we should evaluate how well GPU performance maps onto Datashader's algorithms, and see if there are cases where we recommend using a GPU. Presumably it would be for datasets that fit entirely into memory on the GPU, and on which one needs to do multiple reaggregations; otherwise it seems like speedups will be hard to find due to the I/O time.

@ariejdl
Copy link

ariejdl commented Feb 23, 2018

These folks know about this! https://github.com/mapd/mapd-core Is the architecture comparable?

@jbednar
Copy link
Member Author

jbednar commented Feb 23, 2018

We've talked to them and compared notes, and their approach does share some similarities. But we'll need to move forward with our own GPU implementation before being able to compare in any detail. I'd expect to make use of https://github.com/gpuopenanalytics/pygdf , but don't currently have any funding to work on getting it working...

@ariejdl
Copy link

ariejdl commented Feb 23, 2018

Thanks, like this project, was using it today at work in a notebook. All the best.

@mrocklin
Copy link

Checking in here, is GPU acceleration still a topic of active interest within the Datashader community?

I was talking with @exactlyallan @AjayThorve about this and they seemed interested.

@jbednar
Copy link
Member Author

jbednar commented Apr 10, 2019

We're not only interested, we recently obtained funding for it, and have promised to deliver it sometime in the next 12 months. The agreed task is:

Support GPU DataFrames in Datashader

The new RAPIDS project developed by Nvidia and Anaconda provides a GPU-based DataFrame analogous to the Pandas and Dask DataFrames already supported by Datashader for rendering large datasets for tasks like regridding and resampling (including interpolation and aggregation). Once this support has been added to Datashader, users will be able to choose between Pandas dataframes (typically on a single node, possibly covering multiple CPU cores), Dask dataframes (which can support arbitrarily distributed computations across separate CPU nodes), or GPU dataframes (using data structures held entirely within the GPU memory. Each of these alternatives has different use cases and is optimized for different applications, and users will be able to choose whichever approach is most suitable for their applications.

@mrocklin
Copy link

My guess is that @exactlyallan and @AjayThorve might be good resources while pursuing this work. I'd encourage you all to chat at some point.

@kkraus14
Copy link

I'll also throw my name in the hat here as a maintainer of https://github.com/rapidsai/cudf that I'd be happy to chat if you have any questions or issues 😄

@jbednar
Copy link
Member Author

jbednar commented Apr 10, 2019

Sounds great! I'll send a meeting invite to all of you once we are ready to start.

@jbednar
Copy link
Member Author

jbednar commented Apr 12, 2019

We've looked over our various staff and project and schedules, and we currently plan to start this work in September of this year. It would be nice to start it sooner, but all the people involved at our end are fully booked until then. Could change, but if not, I'll be in touch in August!

@jbednar
Copy link
Member Author

jbednar commented Dec 12, 2019

Took a bit longer than expected, but GPU support is now in Datashader 0.9.0. See #793 .

@jbednar jbednar closed this as completed Dec 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants