Giter VIP home page Giter VIP logo

Comments (4)

sklam avatar sklam commented on May 15, 2024

pygdf still requires data to fit in GPU memory. Our out-of-core solution will be provided by https://github.com/gpuopenanalytics/dask_gdf, which uses pygdf.DataFrame as a building block with an interface similar to dask.dataframe. However, dask_gdf is in earlier stages than pygdf and the docs are a bit lacking at the moment.

Pygdf has a delayed memory deallocation machinery inherited from numba. When a DataFrame is no longer retained by any reference, it goes into a pool for pending deallocation. The memory is deallocated as soon as there are 10 pending deallocation, pending deallocation holds on 20% of total gpu memory, or the numba gpu allocator observes a CUDA_ERROR_OUT_OF_MEMORY.

from cudf.

Aylwin7 avatar Aylwin7 commented on May 15, 2024

Thank you for your answer, so is pygdf not suitable for big data analisis? should I use dask_gdf for big data analisis? How does dask_gdf work?

from cudf.

sklam avatar sklam commented on May 15, 2024

Since big-data analytics is a big umbrella term, I will use the term out-of-core for your usecase where the data cannot fit in ram. We designed pygdf and dask_gdf such that pygdf handles in-core operations and dask_gdf handles out-of-core operations. Recall that dask_gdf uses pygdf as the building block, so it is possible to put pygdf in another out-of-core execution framework (i.e Spark).

For your usecase, you might be more interested in dask_gdf. To learn about dask_gdf, it good to start with learning about dask and the dask.dataframe at http://dask.pydata.org/. The dask_gdf is just dask.dataframe that uses pygdf.DataFrame as the building block instead of the pandas.DataFrame. We currently don't have a documentation page for dask_gdf but we are working on it.

Lastly, pygdf and dask_gdf are both in alpha stage. We encourage early adopters to experiment with them and give us feedback. But, it is not ready for production deployment yet.

from cudf.

Aylwin7 avatar Aylwin7 commented on May 15, 2024

thank you for your answer. I will try it. looking foward to see the final stage of this project

from cudf.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.