Giter VIP home page Giter VIP logo

Comments (5)

buriy avatar buriy commented on August 19, 2024

I guess I might have removed sending lists to negative_keywords and positive_keywords -- only comma-separated string can be now used.
Please use them separated with comma as a workaround for a while, I'll fix it in the next version.

from python-readability.

olivierthereaux avatar olivierthereaux commented on August 19, 2024

Might be worth changing the examples too.

From:

            positive_keywords=["news-item", "block"]
            negative_keywords=["mysidebar", "related", "ads"]

To

            positive_keywords="news-item, block"
            negative_keywords= re.compile(r"mysidebar|related|ads")

https://github.com/buriy/python-readability/blob/master/readability/readability.py#L90

from python-readability.

thom4parisot avatar thom4parisot commented on August 19, 2024

Ah okay, it makes sense, thanks for this prompt feedback!

I do not mind changing the syntax but I'd do it only if the change in long-term. Otherwise I'd rather wait for the fix to be released.

As @olivierthereaux said, better to update the doc (README) and code comments if the change is long term, because we spent a bit of time yesterday, confused for the syntax not to work as documented.

from python-readability.

buriy avatar buriy commented on August 19, 2024

I think, I will allow all three options (comma-separated, list and regex) because it won't cost much -- it should only run once on Document initialization.

from python-readability.

deanishe avatar deanishe commented on August 19, 2024

👍 Whichever options are chosen, the docs should be right.

I couldn't figure out what the problem was because I was only looking at the docstring. As soon as I looked at the code, I realised the docstring is wrong: only regex or strings are allowed, not lists.

from python-readability.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.