Commit Graph

2462 Commits

Author SHA1 Message Date
Mike Fährmann
e2fc4eaa6f [deviantart] detect stash folders (fixes #659) 2020-04-01 01:59:03 +02:00
Mike Fährmann
c034159701 [piczel] fix extraction for single images 2020-03-31 22:47:23 +02:00
Mike Fährmann
699036ea0c [weibo] accept status URLs with non-numeric IDs (#664) 2020-03-31 22:46:50 +02:00
Mike Fährmann
a0111edfd2 change Travis badge URL to .com 2020-03-31 22:23:53 +02:00
Mike Fährmann
fe96f99e4b [hentainexus] reduce line length (flake8) & update test 2020-03-31 22:08:43 +02:00
墨焓
6f81cac8fa Add metadata to hentainexus: circle, event, title_conventional. (#661) 2020-03-31 21:59:02 +02:00
Mike Fährmann
3ed72f82dc release version 1.13.3 2020-03-28 22:03:33 +01:00
Mike Fährmann
6f911aeb1c [deviantart] add error message for cloudFront blocks (#655) 2020-03-28 21:18:04 +01:00
Mike Fährmann
7499d71d02 [simplyhentai] ignore certificate errors in video test 2020-03-28 21:07:30 +01:00
Mike Fährmann
4203dc0bdc [mangapark] fix metadata extraction 2020-03-28 03:00:26 +01:00
Mike Fährmann
6ecb0a19cf handle sys.stdin being None when using '-' as input file (#653) 2020-03-25 22:33:39 +01:00
Mike Fährmann
c8db2a87e9 fix create_test_data.py script 2020-03-25 22:06:38 +01:00
Mike Fährmann
1b82d36ab2 [deviantart] handle decode errors for extended_fetch results (#655)
This isn't going to solve the underlying problem, but it should at
least provide the server response when those errors happen.
2020-03-24 20:56:41 +01:00
Mike Fährmann
88ebbadc58 remove dashes from subcategory names in supportedsites.rst 2020-03-24 02:49:00 +01:00
Mike Fährmann
09f2271528 [35photo] add 'tag' extractor 2020-03-24 02:49:00 +01:00
Mike Fährmann
77fda8190c [35photo] simplify/remove tests for the 'genre' extractor
There is still a nice genre overview page (https://35photo.pro/genre/)
but the individual sub-pages don't list photos anymore
2020-03-24 02:48:25 +01:00
Mike Fährmann
4bc161ca0f prevent crash when sys.stdout and co. are None (#653) 2020-03-23 23:38:55 +01:00
Mike Fährmann
d47d0f757c [travis] allow 'results' and 'snap' tests to fail 2020-03-23 23:06:21 +01:00
Mike Fährmann
ce73796eaa [travis] add flake8 job 2020-03-23 22:59:01 +01:00
Mike Fährmann
fb846c9ee5 [instagram] reduce line lengths and make flake8 happy 2020-03-23 22:56:43 +01:00
Mike Fährmann
ad2efa8509 [e621] derive from Danbooru extractors (#651)
- use extractor implementations from 'danbooru'
- use "page": "b[ID]" to paginate over results instead of
  "tags": "id:<[ID]", avoiding infinite loops with certain
  post orders
- bump User-Agent version
2020-03-22 21:08:45 +01:00
Mike Fährmann
9b39e1cd7e [e621] fix bug in API rate limiting (#651) 2020-03-22 14:01:23 +01:00
Mike Fährmann
b607d0ad7f [twitter] fix typo in 'x-twitter-auth-type' header (#625) 2020-03-21 23:11:39 +01:00
Mike Fährmann
9159cb8fb3 remove trailing dots and spaces from directory names (#647) 2020-03-19 21:12:18 +01:00
Mike Fährmann
2c3b9e1450 [nozomi] support multiple images per post (#646)
This changes the default filename format as well as archive IDs,
since those assumed that each post would only have one image.
2020-03-19 21:07:31 +01:00
Mike Fährmann
c606d0c854 [instagram] update pattern for user profile URLs
Allow for query parameters and fragments,
for example https://www.instagram.com/instagram/?hl=en
2020-03-18 22:24:20 +01:00
Mike Fährmann
2530db3f4d [mangadex] transform 'date' timestamps to datetime objects 2020-03-18 02:19:19 +01:00
Mike Fährmann
ae2a33243b [newgrounds] catch general Exceptions 2020-03-18 02:17:43 +01:00
Mike Fährmann
32e36d8f02 [sexcom] replace tests 2020-03-17 22:47:45 +01:00
Mike Fährmann
33b42dc847 [nozomi] sort search results (fixes #646) 2020-03-17 22:28:23 +01:00
Mike Fährmann
eaa60a438b [piczel] fix extraction
- manually filter by folder_id
- extract data for single posts from embedded JSON, since the
  '/api/gallery/image/<id>' endpoint is no longer available
2020-03-17 17:12:28 +01:00
Mike Fährmann
5bcc7184c9 [danbooru][e621] increase page limits 2020-03-17 15:53:28 +01:00
Mike Fährmann
90d15e3682 [instagram] use 'itertools.chain()' 2020-03-17 15:52:44 +01:00
Leonardo Taccari
160328d21c [instagram] Add support for user's saved medias (#644)
* [instagram] Gracefully handle possible 'HttpErrorPage' in _extract_page()

`HttpErrorPage' is returned in shared_data at least  when not authenticated or
when trying to fetch other users saved medias
(i.e. `instagram.com/<user>/saved/').

Gracefully handle it by returning nothing.

* [instagram] Add support for user's saved medias

(Please note that this need the user to be authenticated and they can
only see their saved media (not other users ones).)

Close #643.

* [instagram] Bump copyright year
2020-03-16 21:09:14 +01:00
Mike Fährmann
e0b0e8d62a release version 1.13.2 2020-03-14 20:19:36 +01:00
Mike Fährmann
5b676ea59d [e621] document username & password support (#640) 2020-03-14 01:15:22 +01:00
Mike Fährmann
1b3ba86110 improve lists in man pages 2020-03-14 01:08:31 +01:00
Mike Fährmann
d3482ace7f [furaffinity] extract more metadata
- views
- favorites
- comments
- rating
- fa_category (since 'category' is already in use)
- theme
- species
- gender
- width
- height
2020-03-13 23:56:55 +01:00
Mike Fährmann
f6c5edb76b pre-compile regex pattern for remove_html() and split_html() 2020-03-13 23:31:54 +01:00
Mike Fährmann
fdd2dd5136 [kabeuchi] add 'user' extractor (closes #561) 2020-03-13 16:45:42 +01:00
Mike Fährmann
59edcdc822 [hitomi] restore metadata fields from before f33b13a
... and add a 'metadata' option to disable
visiting the gallery page and extracting data from it
if this is not needed.
2020-03-12 23:43:41 +01:00
Mike Fährmann
2d5703c493 [twitter] use a simpler data structure to store cookies in cache
Use a dict with name-value pairs instead of an entire
RequestsCookieJar object.
2020-03-12 22:02:12 +01:00
Mike Fährmann
87d4f83597 [newgrounds] make post extraction nonfatal 2020-03-10 01:49:59 +01:00
Mike Fährmann
823fbeaae6 [newgrounds] add 'favorite' extractor (#394) 2020-03-10 01:07:09 +01:00
Mike Fährmann
a45fbc38ea [pixiv] implement 'avatar' option (#595, #623) 2020-03-09 21:18:16 +01:00
Mike Fährmann
a63a376ad2 [mangoxo] fix login 2020-03-08 23:01:51 +01:00
Mike Fährmann
ebc70e87ce [e621] update to new interface / API endpoints (closes #635) 2020-03-06 21:12:58 +01:00
Mike Fährmann
d1cf7ccdb3 [instagram] add 'post_shortcode' metadata field (#525) 2020-03-06 15:20:32 +01:00
Mike Fährmann
402025c3c3 fix some build issues
- use 'os.name' to decide between Windows/Linux build
- don't check Windows executable version number,
  since Wine fails to run the executable and causes release.sh to stop
2020-03-06 15:15:59 +01:00
Mike Fährmann
32df8d06fe [twitter] add 'bookmark' extractor (closes #625) 2020-03-06 01:20:04 +01:00