Giter VIP home page Giter VIP logo

Comments (5)

vanpersie32 avatar vanpersie32 commented on May 24, 2024 1

problems solved in #24 (comment)

from transformers-tutorials.

vanpersie32 avatar vanpersie32 commented on May 24, 2024

same error when running this link

from transformers-tutorials.

ManuelFay avatar ManuelFay commented on May 24, 2024

It seems like the PR fixed the problem in most cases, but when the data is a list of lists (and not a np array), pyarrow is still incapable of dealing with it as is. A quick and ugly fix for the moment can be to cast the lists as a np array.

Ugly fix (waiting for a better option). Change datasets/arrow_writer.py as such:

  try:
      if isinstance(type, _ArrayXDExtensionType):
          self.data = np.array(self.data)     # Here is the line to add
          if isinstance(self.data, np.ndarray):
              storage = numpy_to_pyarrow_listarray(self.data, type=type.value_type)
          else:
              storage = pa.array(self.data, type.storage_dtype)
          out = pa.ExtensionArray.from_storage(type, storage)

from transformers-tutorials.

NielsRogge avatar NielsRogge commented on May 24, 2024

See #36

from transformers-tutorials.

NielsRogge avatar NielsRogge commented on May 24, 2024

The issue linked above should solve your problem. Therefore, closing this issue.

from transformers-tutorials.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.