Commit Graph

3640 Commits

Author SHA1 Message Date
Mike Fährmann
368f156378 [pixiv] rankings: add support for the new daily AI and daily AI R18
(#3214, #3221)

In remembrance of @thatfuckingbird
2022-11-15 11:47:57 +01:00
Mike Fährmann
6c153750fa [nitter] add extractors for Nitter instances (#2696) 2022-11-15 11:44:16 +01:00
Mike Fährmann
9f06e79868 implement '"user-agent": "browser"' (#2636) 2022-11-13 19:17:39 +01:00
enduser420
ade9789b3e [mangaread] update regex 2022-11-13 16:04:17 +05:30
enduser420
039d06c8f6 [mangaread] add 'chapter' and 'manga' extractors 2022-11-13 16:00:34 +05:30
Mike Fährmann
70c7fbe89a [instagram] add 'guide' extractor (#3192) 2022-11-11 20:21:41 +01:00
enduser420
93ea8ca8e3 [imxto] extract additional metadata (#3175) 2022-11-11 08:53:07 +01:00
Mike Fährmann
e3abab8629 [weibo] send 'Referer' headers (#3188) 2022-11-10 17:11:57 +01:00
Mike Fährmann
6423f990de [realbooru] fix 'tags' extraction (#2530) 2022-11-10 17:04:02 +01:00
Mike Fährmann
ecad02cf3f [realbooru] fix download URLs (#2530) 2022-11-10 13:29:35 +01:00
Mike Fährmann
15cd114c9c [twitter] update bookmarks pagination (#3172)
Do not stop when there aren't any tweets in a batch,
but only when the same cursor value appears twice in a row.
2022-11-09 20:40:51 +01:00
Mike Fährmann
20fbba9d7c [exhentai] add metadata to search results (#3181)
'gallery_id' and 'gallery_token'
2022-11-09 11:40:27 +01:00
Mike Fährmann
6a0c5e34f4 [exhentai] fix pagination (#3181) 2022-11-09 11:15:34 +01:00
Mike Fährmann
171262c1b6 [instagram] remove login support
broken feature that I cannot get to work anymore
2022-11-08 21:48:31 +01:00
Mike Fährmann
93e6bd6847 [uploadir] use utf-8 filenames (#3162) 2022-11-05 19:03:48 +01:00
Mike Fährmann
b7a83ac726 [uploadir] update (#3162)
- prevent extra HTTP request from redirects
- add 'id' metadata field
- set 'filename_fmt' and 'archive_fmt'
2022-11-05 18:28:35 +01:00
Mike Fährmann
ccb80f1b8b [uploadir] add support for 'uploadir.com' (#3162) 2022-11-05 14:25:09 +01:00
Mike Fährmann
b0cb4a1b9c replace 'text.extract()' with 'text.extr()' where possible 2022-11-05 01:14:09 +01:00
Mike Fährmann
4fd3c893fa [booru] adjust/match '_tags' and '_notes' code 2022-11-04 19:49:39 +01:00
Mike Fährmann
88954aa2e4 [gelbooru_v02] implement 'notes' extraction
same code as for 'moebooru' works here as well
2022-11-04 19:49:39 +01:00
ClosedPort22
4e80d3210e [tumblr] Fallback to gifv when possible (#3095) (#3159) 2022-11-04 19:42:36 +01:00
thatfuckingbird
9d3f86dbcd [twitter] update URL for syndication API (#3160)
Twitter changed the URL format to access tweet data through their syndication API.
2022-11-04 17:49:45 +01:00
enduser420
c01cad599a [lolisafe] add support for xbunkr (#3156) 2022-11-04 16:13:52 +01:00
Allen
9fc142d27b [mastodon] add "remote_instance" field (#3119)
Example Usage:
If the url is "mastodon:https://mastodon.example.org/@VoteChess@botsin.space the "remote_instance" will be "botsin.space"
...
"directory": ["mastodon", "{remote_instance|instance}", "{account[username]!l}"]
...
2022-11-02 17:09:38 +01:00
Mike Fährmann
2a1cb403ee Revert "[Deviantart] [#1776] Remove the "you need session cookies to download mature scraps" warning (#1777)"
This reverts commit 1f02878351.

Mature scraps do yet again require cookies.
2022-11-01 21:23:05 +01:00
Mike Fährmann
86790da2d5 update Cloudflare IUAM detection
again
2022-10-31 18:33:52 +01:00
Mike Fährmann
775895f44b [booru] refactor 'tags' and 'notes' extraction
- move HTML request for post pages into its own function
- move gelbooru_v02.py notes extraction to gelbooru.py
  since it only works there
- clean up some code
2022-10-31 12:01:19 +01:00
Luc Ritchie
0f9dfb7e62 [instagram] Fix AttributeError on user stories extraction (#3123) 2022-10-30 11:47:06 +01:00
Mike Fährmann
f81dd5297a [skeb] fix extraction (#3112)
'completed_at' is no longer included in API responses
2022-10-29 15:07:21 +02:00
enduser420
fb2dbb04e2 [moebooru] extract 'notes' (#3094) 2022-10-28 18:05:54 +02:00
Mike Fährmann
4e26bf98f5 [aibooru] support 'safe' subdomain (#3110) 2022-10-28 12:10:18 +02:00
Mike Fährmann
5c31791b3c [mastodon] support '/web/' URLs (#3109) 2022-10-28 11:47:00 +02:00
Mike Fährmann
9a2cfd4421 [mastodon] support cross-instance user references (#3109) 2022-10-27 14:26:42 +02:00
Mike Fährmann
58d97188b4 [mastodon] add 'bookmark' extractor (#3109) 2022-10-26 21:28:50 +02:00
Mike Fährmann
46b64251eb [bcy] fix extraction (#3103)
- fix regex for non-watermarked images
- fetch data from '/item/detail' pages for all other posts,
  since '/apiv3/user/selfPosts' only has incomplete data
2022-10-26 15:33:07 +02:00
Mike Fährmann
77173694d5 [kemonoparty] fix 'dms' extraction (#3106) 2022-10-26 14:25:43 +02:00
Mike Fährmann
f168ec9572 [instagram] extract 'coauthors' metadata (#3107) 2022-10-26 14:11:29 +02:00
Mike Fährmann
7c6af27eb8 [tumblr] add 'fallback-*' options (#2957)
specifically 'fallback-delay' and 'fallback-retries'
and change default number of retries to 2 (down from 3)
2022-10-26 13:59:09 +02:00
Mike Fährmann
4aa56d500b [hentaihere] fix test results 2022-10-24 16:56:27 +02:00
Mike Fährmann
75d707fd92 [hentaihere] update
- support minor versions in chapter URLs
- fix manga metadata extraction
- update tests
2022-10-24 16:41:22 +02:00
Mike Fährmann
d2fc73f20b [hentai2read] fix manga metadata extraction
and update tests
2022-10-24 16:31:01 +02:00
Mike Fährmann
f4d06e5180 [manganelo] update domain to 'chapmanganato.com' (#3097) 2022-10-24 16:08:12 +02:00
Mike Fährmann
769e6754dc [pixiv] use 'exact_match_for_tags' as default search mode (#3092) 2022-10-24 16:08:12 +02:00
Mike Fährmann
a90e5cb354 [instagram] support 'instagram.com/s/' highlight URLs (#3076) 2022-10-24 16:08:12 +02:00
enduser420
fd19c4b228 [hentai2read] recognize '.' in chapter (#3089) 2022-10-24 15:53:51 +02:00
enduser420
2ff1897421 [vichan] recognize board url w/o trailing slash (#3087) 2022-10-24 15:52:54 +02:00
enduser420
ac6111e693 [mangasee] add support for 'mangalife' (#3086) 2022-10-21 16:46:36 +02:00
KJ16609
300bc03deb [gelbooru] allow alternate parameter order in post URLs (#2821) 2022-10-21 14:59:29 +02:00
Mike Fährmann
a7d23f1484 [vichan] add generic extractors for vichan imageboards
includes 8kun.top, smuglo.li, and wikieat.club
2022-10-21 14:40:45 +02:00
Mike Fährmann
04d3ebdfb4 [redgifs] fix 'token' extraction (#3080, #3081) 2022-10-21 13:27:16 +02:00