Giter VIP home page Giter VIP logo

Comments (10)

ElPicador avatar ElPicador commented on May 25, 2024

Note to myself:
Add search on gns3: https://secure.helpscout.net/conversation/154616922/4890/?folderId=696715

from docsearch-scraper.

pixelastic avatar pixelastic commented on May 25, 2024

This looks like a pretty big enhancement, considering that the underlying engine we're using for scrapping (Scrapy) only do static HTML parsing. For SPA application, I would rather try to hit the API level if possible, or say that DocSearch is not compatible with their documentation.

For Prezly, they are using readme.io, maybe we could create something directly on readme.io level

from docsearch-scraper.

proudlygeek avatar proudlygeek commented on May 25, 2024

I'd say it's more a feature than an enhancement.

I would personally go with an optional HTTP Proxy which can process JavaScript (PhantomJS / Selenium) documentations and feed the resulting static page into Scrapy / Python. What do you think about this approach?

from docsearch-scraper.

ElPicador avatar ElPicador commented on May 25, 2024

There is Scrapy for JS: https://github.com/scrapinghub/scrapy-splash

from docsearch-scraper.

proudlygeek avatar proudlygeek commented on May 25, 2024

@ElPicador very cool! As I can see it's basically what I said, just more handy and already Dockerized 😄 did you already give it a try?

from docsearch-scraper.

ElPicador avatar ElPicador commented on May 25, 2024

Never, @redox was the one who told me about it

from docsearch-scraper.

redox avatar redox commented on May 25, 2024

@ElPicador @proudlygeek @pixelastic We've been thinking of making it the onboarding project of @aseure :)

from docsearch-scraper.

proudlygeek avatar proudlygeek commented on May 25, 2024

Awesomeness!!! 💯 👍

from docsearch-scraper.

aseure avatar aseure commented on May 25, 2024

I've opened a PR to address those problematic documentations. Please see #46.

from docsearch-scraper.

pixelastic avatar pixelastic commented on May 25, 2024

I think this can be closed

from docsearch-scraper.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.