Mike Fährmann
e60962f7e5
[philomena] improve tag escapes handling ( fixes #1629 )
2021-06-16 18:47:08 +02:00
Mike Fährmann
d8908ca577
[unsplash] update collections URL pattern ( fixes #1627 )
2021-06-15 15:44:45 +02:00
Mike Fährmann
9ed13703cc
[sankaku] handle empty tags ( fixes #1617 )
2021-06-14 16:20:10 +02:00
Mike Fährmann
b56e245094
[500px] update GraphQL queries
...
500px changed its method from query hashes to sending the entire query
string for every request.
2021-06-14 16:13:08 +02:00
Mike Fährmann
a751afdfb3
[twitter] change some defaults
...
- 'retweets' option: true -> false
- 'quoted' option : true -> false
i.e. disable downloading tweets from other user's timelines by default
- search directory:
'["{category}", "Search", "{search}"]' ->
'["{category}", "{user[name]}"]'
i.e. change it to the same as other twitter extractors (#1308 )
2021-06-11 21:26:11 +02:00
Mike Fährmann
4e4ca3c330
[deviantart] pin API version ( #1611 )
...
'/gallery/folders' in the newest version doesn't include subfolders.
It probably only needs the right query parameter to do so, but that
doesn't seem to be documented anywhere.
2021-06-11 00:19:44 +02:00
Mike Fährmann
d09bc5bd34
[subscribestar] improve attachment filenames ( #1609 )
2021-06-10 17:09:13 +02:00
Mike Fährmann
2986bf63bf
[mangafox] update URL pattern ( fixes #1608 )
...
also accept non-numeric volume labels, e.g. vTBD
2021-06-10 14:56:59 +02:00
Mike Fährmann
53dab5c289
[mangadex] revert chapter handling ( #1535 )
...
Spawn a new ChapterExtractor for each individual chapter
instead of handling them directly with a MangaExtractor.
Doing it that way broke too many features like
--chapter-filter, --chapter-range, --zip, etc.
2021-06-09 21:54:40 +02:00
Mike Fährmann
1197ee2c20
[mangadex] add extractor for a user's followed feed ( #1535 )
2021-06-08 02:45:36 +02:00
Mike Fährmann
07c8adbd8b
[mangadex] implement login with username & password ( #1535 )
2021-06-08 02:12:57 +02:00
Mike Fährmann
3e332eaf53
[mangadex] update to API v5 ( #1535 )
2021-06-07 01:29:42 +02:00
Mike Fährmann
04f4f9badb
[oauth] prevent exceptions when reporting errors ( #1603 )
2021-06-05 14:28:26 +02:00
Mike Fährmann
a3bf878329
[idolcomplex] improve and fix pagination ( #1601 )
...
always rely on the 'next-page-url' value and its query parameters
2021-06-04 20:31:08 +02:00
Mike Fährmann
e39c4633ba
[cyberdrop] b64decode -> a2b_base64
2021-06-03 02:53:55 +02:00
Mike Fährmann
407627ec86
[foolfuuka] support 'archive.wakarimasen.moe' ( closes #1595 )
2021-06-02 15:45:43 +02:00
Mike Fährmann
78f89d2e61
[idolcomplex] fix pagination ( closes #1594 )
2021-06-02 15:32:46 +02:00
Mike Fährmann
52052a0e1a
[manganelo] update domain to 'manganato.com'
2021-05-28 17:59:21 +02:00
Mike Fährmann
c80b18a477
[weibo] extend 'retweets' option ( closes #1542 )
...
Setting 'retweets' to "original" will use metadata from the
original posts, and not from the retweeted ones.
2021-05-27 23:09:42 +02:00
Mike Fährmann
c0fa5058da
[kemonoparty] actually add a 'type' metadata field ( #1556 )
2021-05-27 22:11:31 +02:00
thatfuckingbird
264beb8556
recognize v2.mangapark URLs ( #1578 )
...
* recognize v2.mangapark URLs
* update mangapark root url to use the v2 subdomain
2021-05-26 14:58:50 +02:00
thatfuckingbird
e6811c7450
[pixiv] implement 'max-posts' option ( #1558 )
...
* implement max-rank for pixiv
* rename to max-posts and make more generic
2021-05-24 17:49:46 +02:00
Mike Fährmann
8a909e478d
[imagebam] fix extraction of NSFW images ( #1534 )
2021-05-22 21:41:44 +02:00
Mike Fährmann
b5affc62aa
[twitter] rename 'text-only' to 'text-tweets' ( #570 )
2021-05-22 21:41:12 +02:00
Mike Fährmann
724ca61f36
[twitter] add 'text-only' option ( #570 )
2021-05-22 17:01:49 +02:00
Mike Fährmann
8fd8126117
fix ISO 639-1 code for Japanese
...
"jp" -> "ja"
2021-05-22 16:07:04 +02:00
Mike Fährmann
2c60c7d798
[reactor] skip deleted/empty posts
2021-05-21 16:14:09 +02:00
Mike Fährmann
532ac79fb0
update extractor test results
2021-05-21 02:28:53 +02:00
Mike Fährmann
d7bc4a2b8b
[500px] update query hashes
2021-05-21 01:20:31 +02:00
Mike Fährmann
0f35aca728
[aryion] minor code updates
2021-05-19 23:46:33 +02:00
Mike Fährmann
2eb46452ad
[aryion] update 'needle' to not skip text posts ( fixes #1568 )
...
on "Latest Updates" pages
"class='thumb scrollthumb' href='/g4/view/" and
"class='thumb' href='/g4/view/" both end with
"thumb' href='/g4/view/"
2021-05-19 23:35:05 +02:00
Mike Fährmann
4fc9668922
[imgur] update URL patterns ( #1561 )
2021-05-19 15:44:10 +02:00
Mike Fährmann
1eabfa5c7a
[pillowfort] implement login with username & password ( #846 )
2021-05-19 02:59:16 +02:00
Mike Fährmann
24dd10ac3c
[patreon] extract user defined 'tags' ( #1539 , closes #1540 )
2021-05-18 00:35:52 +02:00
Mike Fährmann
a7e4917ee1
[pillowfort] add 'inline' option ( #846 )
...
to support images present in a post's 'content',
but not listed in 'media'.
also separates the file hash present at the beginning
of each 'filename' into its own field.
2021-05-17 03:03:58 +02:00
Mike Fährmann
efa6cc8ec3
[pillowfort] add 'external' option ( #846 )
...
for links to external Twitter posts etc.
2021-05-17 01:46:42 +02:00
Mike Fährmann
394fbb5f56
[twitter] strip useless t.co links ( #1532 )
...
The 'full_text' of Tweets with media content usually ends with a t.co
link to itself. This commit removes those.
2021-05-17 00:20:29 +02:00
Mike Fährmann
41457dbb1b
[twitter] resolve t.co URLs in 'content' ( #1532 )
2021-05-15 18:52:37 +02:00
Mike Fährmann
2b5d80862e
[kemonoparty] add 'type' metadata field ( #1556 )
...
'file', 'attachment', or 'inline'
2021-05-15 01:13:41 +02:00
Mike Fährmann
17b0ccb071
[twitter] add missing retweet media entities ( fixes #1555 )
...
from the original tweets
2021-05-14 22:51:01 +02:00
Mike Fährmann
5eeaaee01d
[pixiv] add 'metadata' option ( #1551 )
2021-05-14 20:30:28 +02:00
Mike Fährmann
0717456b4e
[kemonoparty] add 'metadata' option ( closes #1548 )
...
to fetch creator names with an additional HTTP request
2021-05-14 19:56:49 +02:00
Mike Fährmann
36ed1efcfb
[pixiv] rename "noop" value for 'tags' option to "original"
...
(#1507 )
2021-05-07 20:41:54 +02:00
Mike Fährmann
14f983eab6
[deviantart] use default ID when 'client-id' is None
2021-05-07 16:14:38 +02:00
Mike Fährmann
3e4ffb0821
[gelbooru] add extractor for '/redirect.php' URLs ( #1530 )
2021-05-07 15:34:53 +02:00
Mike Fährmann
5e54105ae4
[instagram] update query hashes
2021-05-06 19:15:18 +02:00
Mike Fährmann
b3ee10a7fb
[500px] update query hashes
2021-05-06 17:28:26 +02:00
Mike Fährmann
15b0241bbc
[imagebam] fix extraction
2021-05-06 16:47:36 +02:00
Mike Fährmann
38ae61edd4
[inkbunny] add 'favorite' extractor ( #1521 )
2021-05-04 19:28:48 +02:00
Mike Fährmann
577fffad5f
[nozomi] update 'archive_fmt' values for tag and search extractors
...
… so they actually work for posts with more than 1 file.
(fixes #1523 )
2021-05-04 19:28:37 +02:00