Giter VIP home page Giter VIP logo

Comments (4)

dosubot avatar dosubot commented on June 29, 2024 2

To address the UnicodeDecodeError you're encountering when querying binary, non-UTF-8 encodable data from a VARBINARY column in Trino using SQL Lab in Superset, consider implementing a custom JSON serializer that can handle binary data by encoding it in a format like base64. This will allow binary data to be serialized into a string format that JSON can handle without errors.

Here are the steps you can take:

  1. Implement a Custom Serializer: Create a custom JSON serializer for handling binary data, ensuring it's encoded in a format such as base64 before serialization.

  2. Modify JSON Serialization Calls: Update the json.dumps calls in sqllab/api.py to utilize this custom serializer, particularly where JSON serialization of execution results and data serialization occurs.

  3. Adjust Data Decompression Logic: In utils/core.py, ensure the decompression logic can handle binary data correctly, which may involve type checks before decompression and appropriate handling for binary data.

  4. Thorough Testing: Test the querying of binary data from VARBINARY columns extensively to confirm the absence of the UnicodeDecodeError.

  5. Documentation: Update project documentation to reflect the handling of binary data in SQL Lab queries.

This approach should help you resolve the encoding issues you're facing with binary data in Superset's SQL Lab.

Details

Sources


About Dosu
This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

To continue the conversation, mention @dosu.

from superset.

rusackas avatar rusackas commented on June 29, 2024 1

I don't think so, but it's possible. I was trying to allow UTF-8 strings through without them being escaped. It's possible this has to do with something being escaped incorrectly, which may or may not be the same chunk o' code.

from superset.

cccs-tom avatar cccs-tom commented on June 29, 2024

+1
We're running into this as well. It also affects the Table viz (and possibly others), not just SQL Lab.
@rusackas Could this have anything to do with the stringify stuff you were trying to remove?

from superset.

AutumnSun1996 avatar AutumnSun1996 commented on June 29, 2024

it seems that simplejson will handle bytes itself, and will not use logic in 'default' function:

import simplejson


def convert_bytes(val):
    if isinstance(val, bytes):
        return 'converted-' + repr(val)
    return val


print(simplejson.__version__)
# 3.17.3

print(simplejson.dumps({'a': b'b'}, default=convert_bytes))
# {"a": "b"}

print(simplejson.dumps({'a': b'\x00\x85'}, default=convert_bytes))
# raises UnicodeDecodeError

so bytes should be handled before they goes into the dumps function.

from superset.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.