Comments (5)
Hi @raphaelzhou1, news-crawl is built with Java 8. Sorry, this information wasn't given in the README (updated now) but only in the pom.xml. Please, install JDK 1.8, make JAVA_HOME point to it and try again. I'm also not 100% sure whether Apache Storm 1.2.3 or 1.2.4 runs on ARM. However, I expect that development and testing should be possible.
from news-crawl.
Our typical news crawl user just downloads the WARCs we generate. We don't test this software outside of our environment.
However, if you want some clues from someone who also is not that familiar with Java, in the screenshots there's a clue near the end of all of that confusing output:
Please refer to /Users/user/Desktop/Coding/AI/Context/financial_news_extractor/news-crawl/target/surefire-reports for individual test results
from news-crawl.
from news-crawl.
@raphaelzhou1 Did Java 8 work for you? Can we close this issue?
from news-crawl.
from news-crawl.
Related Issues (20)
- Allow to follow news sites not providing RSS/Atom feed or news sitemap HOT 2
- Do not use "http/2" protocol version in HTTP headers in WARC files HOT 1
- Error in build docker HOT 3
- Odd duplicate content behaviour on www.diariodeavila.es domain HOT 4
- How to get a listing of WARC/WAT/WET files using HTTP for News Dataset ? HOT 2
- News archive is not available since 06.06.2021 HOT 3
- Run docker in a non-interactively way HOT 1
- How large is the dataset HOT 2
- Use wikidata to complete seeds HOT 1
- Explore schema.org annotations for seed completions
- Consider archiving of news feeds and sitemaps
- produce WET files? HOT 6
- News archive is not available since 2023-10-23 15:36:50 HOT 1
- Avoid following advertisements in news feeds and sitemaps
- Nutch-compatible implementation of FastURLFilter + use it in PreFilterBolt
- Port topology and resources to StormCrawler 2.10 HOT 2
- news-crawl 2.x Broken when using multiple workers (across multiple hosts) HOT 17
- Have as many WARCBolt instances as there are workers
- Route tuples to the status updater bolt based on URLs
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from news-crawl.