Commit Graph

2673 Commits

Author SHA1 Message Date
Mike Fährmann
c45770331a use 'str.partition()'
The (r)partition method is always faster then split() or any other
method that has been replaced in this commit.
2017-08-21 18:29:50 +02:00
Mike Fährmann
017a72f448 [pixiv] improve input validation 2017-08-21 17:53:27 +02:00
Mike Fährmann
dcf42c5e89 [pixiv] add extractor for ranking lists 2017-08-20 20:21:52 +02:00
Mike Fährmann
4ea82ea556 [warosu] add thread extractor 2017-08-18 19:54:07 +02:00
Mike Fährmann
9aa95fba8c [deviantart] adapt download URLs to use https
Even though DeviantArt is "completely switching over to HTTPS"[1],
every URL contained in an API response is still using HTTP

[1] https://danlev.deviantart.com/journal/DeviantArt-Is-Switching-To-HTTPS-697996906
2017-08-16 12:17:50 +02:00
Mike Fährmann
02e89700fc [foolfuuka] ensure sorted posts 2017-08-13 14:29:26 +02:00
Mike Fährmann
8bcf88bff7 [flickr] fix extraction
This issue was only noticeable with older Python versions, as these
don't exhibit a consistent ordering of dict keys.
2017-08-12 21:41:10 +02:00
Mike Fährmann
004456d5d5 properly update the config-dictionary
When using 2 or more config files, the values of the second would
improperly overwrite nested dictionaries of the first one.
The new method properly combines these nested dictionaries as well.
2017-08-12 20:07:27 +02:00
Mike Fährmann
cfa479fab5 update error message for unspecified exceptions
- ask user to report unexpected errors, which usually indicate
  extractor failure
- handle OSErrors separately (permissions, disk full, etc)
- revert 30eef52
2017-08-10 16:35:46 +02:00
Mike Fährmann
7e936e9c06 [luscious] simplify and remove dead code 2017-08-08 19:26:13 +02:00
Mike Fährmann
0245a0ba5f fix extraction and update test results
- fixes for hbrowse, imgyt, imgcandy, hosturimage
- test updates for deviantart, gfycat
2017-08-08 19:11:13 +02:00
Mike Fährmann
abd7c559cd [yonkouprod] remove module
Every manga chapter on this site has been removed.
2017-08-07 18:32:14 +02:00
Mike Fährmann
da7219ba74 [kisscomic] remove module
Image links on this site are dead.
2017-08-07 18:28:35 +02:00
Mike Fährmann
852e7acd31 [twitter] ignore "Promoted Tweets" 2017-08-06 13:43:08 +02:00
Mike Fährmann
915a0137de improve 'extractor.request'
- add 'fatal' argument
- improve internal logic and flow
- raise known exception on error
- update exception hierarchy
2017-08-05 16:11:46 +02:00
rachmadani haryono
dcd573806e chg: dev: fix error (#32)
* fix: dev: error

* fix: dev: AttributeError when getting artist

* fix: dev: typo on luscious parser
2017-08-04 15:01:10 +02:00
Mike Fährmann
c4713404c8 [directlink] improve URL pattern 2017-08-02 21:06:49 +02:00
Mike Fährmann
d443822fdb [luacious] get correct image URLs (fixes #33)
Instead of using thumbnail URLs and modifying them the extractor now
goes through every single image-page and gets its download URL from
there.
2017-08-02 19:58:13 +02:00
Mike Fährmann
6950708e52 [hentaicdn] use HTTPS 2017-08-02 18:31:21 +02:00
Mike Fährmann
4f1e6c109f [deviantart] remove 'invalid escape sequence' warning
- use r"\w" or "\\w" instead of "\w"
2017-07-27 20:50:33 +02:00
Mike Fährmann
c864be479e [directlink] update URL pattern & PEP 8
- combine some file extensions
- don't match '.je'
- line length < 80
2017-07-27 20:46:15 +02:00
H R X N
45f9d64c23 Update directlink.py with additional file exts. (#30)
Add WebP, still not that common, but it's increasing.
Add 3rd JPEG variant (https://en.wikipedia.org/wiki/JPEG#JPEG_filename_extensions)
Never seen JFIF in the wild, would probably be overkill.
Extend Ogg formats (https://en.wikipedia.org/wiki/Ogg; https://wiki.xiph.org/MIME_Types_and_File_Extensions)
2017-07-27 20:40:00 +02:00
Mike Fährmann
4357966a70 [kissmanga] make URL pattern case-insensitive (fixes 28) 2017-07-26 10:36:59 +02:00
Mike Fährmann
7aa9fa796a code cleanup and fixes 2017-07-25 14:59:41 +02:00
Mike Fährmann
f08af03845 Merge branch 'cookies' 2017-07-25 14:04:53 +02:00
Mike Fährmann
55f048d02b ignore case of cookiejar magic strings 2017-07-24 18:33:42 +02:00
Mike Fährmann
f53bf1a323 [thebarchive] add thread extractor 2017-07-23 15:45:17 +02:00
Mike Fährmann
b8cf434bb0 [rebeccablacktech] add thread extractor 2017-07-23 15:41:56 +02:00
Mike Fährmann
808f67ba7d use 'cookiedomain' for cookies set by object-config-values
otherwise these cookies would not be picked up by the
_check_cookies() method.
2017-07-22 15:43:35 +02:00
Mike Fährmann
390eeded4c [mangazuki] support 'raws.…' subdomain 2017-07-21 16:25:56 +02:00
Mike Fährmann
4a60f6068a [mangazuki] add manga extractor 2017-07-20 16:02:09 +02:00
Mike Fährmann
394241cd6f [2chan] fix extraction 2017-07-20 15:01:47 +02:00
Mike Fährmann
a13eb6010f [fallenangels] fix extraction of chapter URLs 2017-07-20 14:58:47 +02:00
Mike Fährmann
1cb1d2e0a3 [mangazuki] add chapter extractor 2017-07-19 17:20:03 +02:00
Mike Fährmann
2f2e363c97 [imgur] use /a/<key>/all as album-url 2017-07-18 21:06:31 +02:00
Mike Fährmann
1cec03c9c6 [imgur] fix extraction of large albums 2017-07-18 12:42:19 +02:00
Mike Fährmann
0610ae5000 skip login if cookies are present 2017-07-17 10:33:36 +02:00
Mike Fährmann
f105782435 [fireden] add thread extractor 2017-07-15 14:51:58 +02:00
Mike Fährmann
c93f7d7496 [archiveofsins] add thread extractor 2017-07-15 13:23:04 +02:00
Mike Fährmann
96e13604da [archivedmoe] add thread extractor 2017-07-14 13:25:53 +02:00
Mike Fährmann
30d3a5f9b2 support redirects on 4chan archives 2017-07-14 13:24:09 +02:00
Mike Fährmann
98464d1f1b [loveisover] add thread extractor 2017-07-14 11:17:47 +02:00
Mike Fährmann
47692f28da [2chan] add thread extractor 2017-07-14 08:44:31 +02:00
Mike Fährmann
3460dc8950 update gallery-dl.conf 2017-07-14 08:23:11 +02:00
Mike Fährmann
9be8f7e106 [deviantart] add "extractor.deviantart.flat" option
Setting this to 'false' downloads images into individual subdirectories
for each gallery-folder or favourite-collection, otherwise it is just
creating a flat list of images.
2017-07-12 17:05:31 +02:00
Mike Fährmann
d075627fd9 [deviantart] support group galleries (#26)
For groups the 'GalleryExtractor' collects all gallery-folder URLs
and defers its work to the 'FolderExtractor'.
2017-07-12 09:47:01 +02:00
Mike Fährmann
b37a62501b [pixiv] unquote tags 2017-07-12 08:21:29 +02:00
Mike Fährmann
fbd7dcdfdb [desuarchive] add thread extractor 2017-07-11 17:48:22 +02:00
Mike Fährmann
af9bd17b19 [deviantart] adjust default paths
- user.deviantart.com/(gallery|favourites|journal)/ images go into
  * <user>/
  * <user>/Favourites/
  * <user>/Journal/
  (having an extra "Gallery" folder for a user's gallery-images seems
   a bit too much if these are all you want to download, which is
   probably the default use-case)

- single "deviations" (user.deviantart.com/(art|journal)/name-123) go
  into their owner's directory:
  * <user>/
  (putting them into their own directory seems weird in practice)
2017-07-10 18:54:10 +02:00
Mike Fährmann
eb64fb267c [nyafuu] add thread extractor (#18) 2017-07-08 17:16:41 +02:00