Giter VIP home page Giter VIP logo

iwara_crawler's People

Contributors

qionglu735 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

z1137254268

iwara_crawler's Issues

Coding not set (possible environment differences), and syntax errors

Unfortunately, on attempt to run on Raspbian, using original file iwara_crawler_v2.py (prior editing to add my sources), i get following:

File "/home/pi/Scripts/iwara_crawler_v2.py", line 33
SyntaxError: Non-ASCII character '\xe5' in file /home/pi/Scripts/iwara_crawler_v2.py on line 33, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details

After searching for solution i add this to beginning of file as per: Python-no-encoding-declared
# -*- coding: utf-8 -*-

After adding this, I get another error:

  File "/home/pi/Scripts/iwara_crawler_v2.py", line 79
    def get(self, url, params=None, **_kwargs) -> requests.Response:
                                               ^
SyntaxError: invalid syntax

I'm not able to understand why this particular dash is a syntax violation.
It seems whole -> part is not recognized as "reverse annotation"?
Studying PEP3107, didn't quite figured what needed to be changed for now...

Platform:

PRETTY_NAME="Raspbian GNU/Linux 10 (buster)"
Python 2.7

Index out of range for multiple sources

Recently there's more and more authors which can be listed (list retrieved), but on download attempt, following occurred:

pi@Pi4-Secondary:~ $ python ~/Scripts/iwara_crawler.py
腿 玩 年 https://ecchi.iwara.tv/users/%E8%85%BF%20%E7%8E%A9%20%E5%B9%B4/videos
Reading Page No.1 ...
Video List:
--------------------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/pi/Scripts/iwara_crawler.py", line 327, in <module>
    main(user["user_name"], user["file_prefix"], user["download_index"])
  File "/home/pi/Scripts/iwara_crawler.py", line 166, in main
    download_list.append(video_list[index])
IndexError: list index out of range

Running on Raspbian 10 Buster, on Raspberry 4B, headless, over SSH session.
Issue was occurring with one or few authors per month for last year (around the time I'm suing this project, my gratitude for sharing).
For last 2 weeks, basically every source I add shows this error.
Re-running prerequisites installation changed nothing.
Not using proxy (proxy IP commented with #), as my home setup doesn't have one, no other changes made, besides other authors added
Tested and experiencing it with following sources listed (as example, including original script sources, and my added sources):

    {"user_name": "vostman", "file_prefix": "", "download_index": [-1]},
    {"user_name": "The_Snowing-Fire", "file_prefix": "", "download_index": [-1, -2]},
    {"user_name": "mimimizu", "file_prefix": "", "download_index": [-1, -2]},
    {"user_name": "burton3rd", "file_prefix": "", "download_index": [-1, -2, -3]},
    {"user_name": "じさくのう", "file_prefix": "", "download_index": [-1, -2, -3]},
    {"user_name": "淫集院 笹", "file_prefix": "", "download_index": [-1, -2, -3]},
    {"user_name": "ViciNeko", "file_prefix": "", "download_index": [-1, -2]},
    {"user_name": "3Dimm Animations", "file_prefix": "", "download_index": [-1]},
    {"user_name": "sugokunemui", "file_prefix": "", "download_index": [-1]},
    {"user_name": "贾唯℡", "file_prefix": "AlZ", "download_index": [-1]},
    {"user_name": "HornOwl", "file_prefix": "", "download_index": [-1]},
    {"user_name": "kem_kem", "file_prefix": "", "download_index": [-1]},
    {"user_name": "lovemax", "file_prefix": "", "download_index": [-1]},
    {"user_name": "uAYx5C", "file_prefix": "", "download_index": []},
    {"user_name": "yaharuka", "file_prefix": "", "download_index": [-1, -2, -3]},
    {"user_name": "塩ひとつまみ", "file_prefix": "", "download_index": [-1]},

Leaving BLANK []} as following shows same error

    {"user_name": "3Dimm Animations", "file_prefix": "", "download_index": []},

Leaving positive value [1]} to download specific video, not last or pre-last shows same error, with traceback to different row, but same function.

pi@Pi4-Secondary:~ $ python ~/Scripts/iwara_crawler.py
3Dimm Animations https://ecchi.iwara.tv/users/3Dimm%20Animations/videos
Reading Page No.1 ...
Video List:
--------------------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/pi/Scripts/iwara_crawler.py", line 327, in <module>
    main(user["user_name"], user["file_prefix"], user["download_index"])
  File "/home/pi/Scripts/iwara_crawler.py", line 164, in main
    download_list.append(video_list[index - 1])
IndexError: list index out of range

Unfortunately, my skills at Python are not sufficient to understand and fix it myself, I'm just a beginner.

For iwara_crawler_v2, selenium/fake-useragent aren't part of "requirements.txt"

Hello there.

Please remember following files aren't part of requirements, and cause script v2 execution to fail, unless installed manually.
Maybe worth mention in text, or add to requirements.txt

  • selenium (can be installed with sudo pip3 install selenium)
  • fake-useragent (can be installed with sudo pip3 install fake-useragent)

Example of my attempt to run script v2

[user]@Pi4-Primary:~/Scripts $ python3 ~/Scripts/iwara_crawler_v2_mod.py
Traceback (most recent call last):
  File "/home/pi/Scripts/iwara_crawler_v2_mod.py", line 2, in <module>
    from fake_useragent import UserAgent
ModuleNotFoundError: No module named 'fake_useragent'

After installation of fake-useragent execution gets through this step.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.