Comments (3)
Can you clarify a bit, do you want to replace the Browser/HttpFetcher or add an additional one?
from skrape.it.
hey,
the httpFetcher is currently using jsoup to make http requests.
https://github.com/skrapeit/skrape.it/blob/master/src/main/kotlin/it/skrape/core/fetcher/HttpFetcher.kt
we want to use KoHttp instead because it ships with a bunch of nice features like async requests and use jsoup for the document parsing only.
one thing to notice:
currently it is possible to add validateTLSCertificates
and maxBodySize
parameters to the request. since I don't think this is possible with HoHttp this ones can be removed from Request if it is not possible to manage this behavior with KoHttp.
The other Request options should be working further on.
since KoHttp provides different functions to handle certain http-request verbs (e.g. httpGet()
, httpPost()
) and scrape{it}'s approach is to pass the http verb as an parameter (Enum) it looks like we will need a mapping from org.jsoup.Connection.Method (which is the Enum that is currently used for method
field in Request.kt) or even better providing an Enum our own and get rid of the jsoup one.
from skrape.it.
I just opened #64, let me know if it's what you're looking for
from skrape.it.
Related Issues (20)
- [QUESTION] cannot access website using BrowserFetcher HOT 2
- [QUESTION] Charset not applied and fetch question marks instead HOT 2
- No field INSTANCE of type Lorg/apache/http/message/BasicLineFormatter; in class Lorg/apache/http/message/BasicLineFormatter[BUG]
- [QUESTION] Retrieving parent's next sibling?
- [QUESTION] Unable to use HttpFetcher and/or AsyncFetcher with ktor
- [QUESTION] How to disable log in the console
- [FEATURE] Kotlin Multiplatform support HOT 11
- [BUG] KDoc should use Kotlin formatting, not Javadoc HOT 5
- [FEATURE] Support for native image (Spring Native/GraalVM) HOT 8
- Three vulnerabilities detected
- [BUG] Crash on Android when using R8 HOT 2
- [BUG] Spring Boot 3.0.0-M5 - new logback version included into Spring Boot causing error HOT 1
- [BUG] No static field INSTANCE of type Lorg/apache/http/message/BasicLineFormatter
- [QUESTION] Execution error on some android devices
- [QUESTION] Tell SkrapeIT to wait until each JavaScript loaded completely HOT 3
- [BUG] Error on Android: api data not showing in release version but works fine in the debug mode HOT 5
- [QUESTION] Getting "Connection refused: no further information" when calls skrape.it exctract methods from submodule HOT 1
- [BUG] mode = Mode.DOM seems to be not working anymore HOT 2
- [BUG] logback.xml in jar HOT 1
- [BUG] Docs typo for async fetcher library
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from skrape.it.