Giter VIP home page Giter VIP logo

ripme's People

Contributors

0834dfbd2b avatar 4pr0n avatar blockmebob avatar blth avatar chikitulfo avatar cyian-1756 avatar drguildo avatar erayan avatar fgts avatar gaboso avatar gitter-badger avatar i-cant-git avatar jgibson02 avatar kas-luthor avatar l1am9111 avatar lacqui avatar lfaoanl avatar maheee avatar metaprime avatar midweekmayhem avatar propergeezer avatar rephormat avatar sdt0001 avatar starb0bby avatar sukhois avatar torbica avatar trilader avatar wiiplay123 avatar wraithfive avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ripme's Issues

Deviantart: rips "preview" version of picture? (Sometimes but not always the original)

Examples:

http://yakovlev-vad.deviantart.com/art/strange-plant-412293638
Expands properly without having to touch the "Download" button, so RipMe downloads it without a hitch and in the original resolution.

http://csimadmax.deviantart.com/art/glitterface-344240007
This however, does not. Even though they are both .PNG's, and this picture is smaller in both vertical and horisontal resolution, Deviantart refuses a proper expansion into the original picture without clicking the "Download" button to the side. RipMe downloads the smaller "preview" instead of the original.

Edit: So it turns out this is a limitation that can be set in place by uploaders on a picture-by-picture basis. I suspect the fix required to make the downloads consistent would have to be that RipMe takes the picture from the link the "Download" button provides?

Edit2: My god, turns out they also have a limit on how much you can download before the downloadbutton stops working, this might be a lost cause.

Error when fetching update: Address family not supported by protocol family

Whenever I start up the program I'm hit immediately with this error:

Startup Error

After the closing the window the program proceeds to function normally. I can enter the url and change the settings. However once I click rip the same error comes up:

Ripping Error

I tried running it normally and with admin rights through the command prompt, both with the same result. What do you think could be the issue? Let me know if I can help in anyway. Thanks!

Tumblr tags get trimmed if they contain spaces

There's a bug with the tumblr ripper (at least) that trims the url after the first space (if any). And then it tries to download the trimmed tag instead of the original one.

One example would be http://naughtynicegirl69.tumblr.com/tagged/my%20gif
Expected behaviour:

  • Create folder in rips location named "tumblr_naughtynicegirl69_tag_my_gif"
  • Download all images within that tag "my gif".

Current behaviour:

  • Create folder in rips location named "tumblr_naughtynicegirl69_tag_my" (trimmed to the space)
  • Download all images within the trimmed tag "my"

This happens whether the space in the given url is "%20" or an actual space " "

Deviantart picture numbering

Right now they are numbered starting at 001 for the newest piece in a gallery, and while it is a really minor problem, I could see it being pretty nice having them numbered starting with the last (oldest) picture.

Thoughts?

Multiple attempts to retry after 404 error

While retries are probably good for some errors, it seems fairly plain that a 404 error should be considered non-recoverable. However, ripme continues to retry, up to the retry limit, for non-existant files. This feels like a mistake.

Scheduling Rerip all or schedule task to call feature

First of all, this tool is awesome. If there's anywhere I can donate to the developer, please let me know. I'd be delighted to do so. Thanks for all the effort you put in.

Feature request would be to have some method to schedule rerip all. I poked at it for a couple of minutes, but my lack of code knowledge made it too challenging for me. I was thinking I might be able to call the feature from a script on my machine, but couldn't figure that out either.

Thanks!

Retrieve album captions and replace folder names with them if available

Example: http://www.imagefap.com/pictures/4649440/Frozen-(Elsa-and-Anna)
Make folder name "Frozen (Elsa and Anna)" instead of "imagefap_4649440".
Same with imgur and where applicable.

Alternatively: "Frozen (Elsa and Anna) [imagefap_4649440]", or a setting to change the behaviour.

The same thing might be applied to filenames (imgur gallery images sometimes have captions).
Imgur example: http://imgur.com/a/bXQpH

To preserve the original web-link names, maybe return the per-rip logs from the web-version of the ripper for every rip with lines like
Retrieved: Original URL
Saved as: rips\album caption\01_file caption

Also a reminder to sanitize caption inputs from captions that will break the filesystem ( \ / ? * " < > | : for NTFS) if this issue is to be considered.

Freezing when using Re-rip all

Hey There,

Props for this awesome little program. I'm having a problem where when using the Re-rip all command (for about 86 Reddit user links) the program will hang at some point. It seems to be related to the Log view as when it crashes the bottom of the Log view goes squint (see image). If I close the Log view by clicking on the button whenever it pops ups I can successfully rip the entire list.

Cheers!

screen shot 2014-04-21 at 10 18 50
screen shot 2014-04-21 at 10 25 58

Sometimes photos aren't renamed after download completes

Most of the album will be correctly downloaded, but a couple of images are named like 013_image.php (which seems an odd choice for a default extension for downloads anyway). Changing extension to .jpg shows that the picture was properly downloaded, just not renamed.

Possibly a thread locking or concurrency issue?

I'm on Windows 7 btw

fetch Huge Thumbnail instead of original images from imgur

ripme fetch images ending with h instead of the original one
Problems is that h is smaller resolution[1]

log
http://0bin.net/paste/DrMqm9pEYQTzRJ3+#wRLWhmMQ9fufGXUV72rSyqBzD9+lEXtaAHWX/qb/P7M=
sfv diff
http://cryptb.in/Z2#c7612971639c0e80daa22fb669fb2b72

[1] http://api.imgur.com/models/image

There are 6 total thumbnails that an image can be resized to. Each one is accessable by appending a single character suffix to the end of the image id, and before the file extension. The thumbnails are:
...
h Huge Thumbnail 1024x1024 Yes"

Large albums not fully ripped

When ripping large imgur albums (eg imgur.com/r/wallpapers ), (several hundred images), the full album is often not fully ripped. Presumably related to imgur page loading (56 images/page) and/or the occasional "load more images".
This may be related to ripping on relatively low bandwidth connection.
If re-rip is attempted with overwrite disabled, sometimes the successive rip attempts will rip additional imgur pages.

Request sites to support here

PLEASE include an example URL of the type of album to rip.


Supported sites + commits:


Pending sites + examples (NSFW):


Enhancement requests:

  • flickr - login support (see #32)
  • vk - Video ripper rename files
  • twitter - Search ripping
  • gifyo - Don't skip some images
  • *chan rippers - Handle external links (vocaroo), store comments
  • tumblr - Rip by date (/archive/year/month)

Skipping http://i.imgur.com/VpStI.jpg -- already attempted

This is not a bug, just a thing I think should be corrected:

I was downloading this:
goneabitwild.imgur.com
but this happened:

2014-04-29 16:48:37,953 22945 [pool-1-thread-2] INFO ripper.DownloadFileThread - Downloading file: http://i.imgur.com/VpStI.jpg
2014-04-29 16:48:38,347 23339 [pool-1-thread-2] INFO ripper.DownloadFileThread - [+] Saved http://i.imgur.com/VpStI.jpg as ./rips/imgur_goneabitwild/8OUfg/002_VpStI.jpg
2014-04-29 16:48:47,004 31996 [Thread-6] INFO ripper.AbstractRipper - [!] Skipping http://i.imgur.com/VpStI.jpg -- already attempted: ./rips/imgur_goneabitwild/KTOKb/003_VpStI.jpg

I don't think VpStI.jpg should be skipped in /a/KTOKb

thanks

Program becomes unresponsive when ripping extremely large (15,000+) galleries.

RipMe seems to slow down and become completely unresponsive when:

  • downloading galleries larger than 15,000 images
  • using Re-Rip all on a number of galleries that total more than 79,850 images.

It seems once ripme.log reaches a filesize of 20MB, it stops updating the log, and soon after, stops responding.

Keep in mind, I've only tested this with Tumblr, because I've only been able to find galleries of that size there.

Jars with spaces in folder/file name don't work

2014-04-06 01:07:16,448 1438 [AWT-EventQueue-0] DEBUG utils.Utils - 
    jarPath = /C:/test%20folder/ripmes.jar 
2014-04-06 01:07:16,448 1438 [AWT-EventQueue-0] ERROR utils.Utils - 
    Error while loading jar file: java.io.FileNotFoundException: C:\test%20folder\ripmes.jar (The system cannot find the path specified)

Queue system

Clipboard autorip doesn't work if ripper is already ripping an album.

Rip Queue could be another tab (like "History") showing upcoming downloads.

UI is already pretty busy, though..

Shared album discovery

Best part of the ripper site was seeing what other people were ripping.

That's gone in the stand-alone client-side version.

Possibilities:

  • RipMe tells a server what albums it's downloading; provides URLs to 4 sample images to be thumbnailed.
  • Server provides API to get most-recently requested albums and their thumbnails.
  • User can decide to re-rip the album to their computer to view the rest.

Not sure how this would fit into the existing UI... seems like a separate window is appropriate.

Ability to label rip folders

It would be nice to have the option to add a string of text to either the beginning or end of the file rip folder (top help with organizing/labeling/tagging)

u/micah376

Imgur albums with subdomain names

I'm having trouble ripping imgur albums with a subdomain format, for instance: http://goneabitwild.imgur.com/
This issue doesn't occur with imgur albums with the format imgur.com/a/#####

Not sure if there is a problem on my end, but I've confirmed the issue with multiple examples of both types of albums. In case its a bug I thought it was worth reporting.

Support for old imgur URL format

Old imgur urls were formatted like so.. http://dayah.imgur.com/kapow
Clicking through manually, the new url is http://imgur.com/a/gKA9V

Support imagefap /gallery/ Links

Please add support for the /gallery/ links.

http://www.imagefap.com/gallery/4753165
http://www.imagefap.com/pictures/4753165/exilevilify---reddit-gonewild%21

Usernames with - or _

Usernames with - and _ aren't ripped properly. .

Likely regex/replacement is causing the issue, or the filesystem saves (?)

Only rips from gonewild.rarchives?

If you try to rip a user that is not found on gonewild.rarchives it wont rip anything. Just a blank folder. Why wouldn't it rip like the old web based and pull from the profile? Or am I missing something

2014-04-23 12:58:01,512 5901 [Thread-2] ERROR rippers.GonewildRipper  - Error while retrieving user posts:Traceback (most recent call last):
  File "api.cgi", line 213, in <module>
    print dumps(main(), indent=2)
  File "api.cgi", line 29, in main
    elif method == 'get_user':  return get_user(keys)
  File "api.cgi", line 72, in get_user
    count   = int(keys.get('count', 10))
  File "/home/rarchivers/gonewild.rarchives.com/py/Queries.py", line 307, in get_user_posts
    userid = db.select_one('id', 'users', 'username = ?', [user])
  File "/home/rarchivers/gonewild.rarchives.com/py/DB.py", line 213, in select_one
    return one[0]
TypeError: 'NoneType' object is unsubscriptable

2014-04-23 12:58:01,513 5902 [Thread-2] INFO  ripper.AbstractRipper  -    Rip completed!

Auto-update renaming

After auto-updating 1.0.11 to 1.0.12 the executable file did not rename itself from ripme.jar.update, had to rename it manually.

Generic timeouts and retries for retrieving pages from all rippers

Sometimes a page doesn't load, or takes a while. Retry logic when this happens could be useful.

Caveats:

  • Each ripper has it's own unique settings for retrieving the page (Jsoup's get() or execute(), cookies, referrers, useragents)
  • All existing rippers will need to be changed to use the generic getter (and tested)

ehentai ripper throwing No Protocol error

I keep getting a "No Protocol" error when trying to use the eHentai ripper

2014-05-13 16:07:14,355 44573 [Thread-8] INFO  ripper.AbstractRipper  - Deleting empty directory /home/Random/rips/e-hentai_Artist_Coco
2014-05-13 16:07:14,968 45186 [AWT-EventQueue-0] INFO  ripper.AbstractRipper  - [+] Creating directory: ./rips/e-hentai_Artist_Coco
2014-05-13 16:07:14,968 45186 [AWT-EventQueue-0] DEBUG ripper.AbstractRipper  - Set working directory to: /home/Random/rips/e-hentai_Artist_Coco
2014-05-13 16:07:14,968 45186 [AWT-EventQueue-0] DEBUG ripper.DownloadThreadPool  - Initializing Main thread pool with 10 threads
2014-05-13 16:07:14,969 45187 [Thread-9] INFO  rippers.EHentaiRipper  -     Retrieving http://g.e-hentai.org/g/######/##########/
2014-05-13 16:07:15,097 45315 [Thread-9] ERROR ripper.AbstractRipper  - Got exception while running ripper:
java.net.MalformedURLException: no protocol: 
    at java.net.URL.<init>(Unknown Source)
    at java.net.URL.<init>(Unknown Source)
    at java.net.URL.<init>(Unknown Source)
    at com.rarchives.ripme.ripper.rippers.EHentaiRipper.rip(EHentaiRipper.java:96)
    at com.rarchives.ripme.ripper.AbstractRipper.run(AbstractRipper.java:290)
    at java.lang.Thread.run(Unknown Source)
2014-05-13 16:07:15,097 45315 [Thread-9] INFO  ripper.AbstractRipper  -    Rip completed!
2014-05-13 16:07:15,097 45315 [Thread-9] INFO  ripper.AbstractRipper  - Deleting empty directory /home/Random/rips/e-hentai_Artist_Coco

Flickr Rip Support Limited

Hello,

The homepage says that Flickr is one of the supported sites for the ripme program. Some pages are still unable to be ripped. For instance I tried ripping three types of flickr pages and none seem to work.

Tagged Flickr page: https://www.flickr.com/photos/8189261@N05/tags/raych/
Album/Sets Flickr page: https://www.flickr.com/photos/8189261@N05/sets/72157628866931619/with/7052912871/
User Flickr page: https://www.flickr.com/photos/8189261@N05

Flickr pages with numbers like 8189261@N05 instead of usernames don't work at all. Only flickr pages with names in the url such as https://www.flickr.com/photos/constaza/ work. Unfortunately, the program doesn't rip albums or tagged pics in usernames though such as https://www.flickr.com/photos/katj2day/sets/72157638291120306/ -or- https://www.flickr.com/photos/katj2day/tags/sexy/. This can be troublesome if a user has thousands of pics and you only want to get an album or tagged pics.

If it's possible to add numbered users like 8189261@N05, album/set rip capability, and tags rip capability, that'd be fantastic.

Irregardless of all this, I love the program so far and look forward to it getting better everyday. Appreciate all the effort.

Deviant art limitations

Nice app. However it seems it can rip only the 1st page of a multipage gallery in Deviant art.

Keep up the good work.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.