Mike Fährmann
368f156378
[pixiv] rankings: add support for the new daily AI and daily AI R18
...
(#3214 , #3221 )
In remembrance of @thatfuckingbird
2022-11-15 11:47:57 +01:00
Mike Fährmann
6c153750fa
[nitter] add extractors for Nitter instances ( #2696 )
2022-11-15 11:44:16 +01:00
Mike Fährmann
9f06e79868
implement '"user-agent": "browser"' ( #2636 )
2022-11-13 19:17:39 +01:00
enduser420
ade9789b3e
[mangaread] update regex
2022-11-13 16:04:17 +05:30
enduser420
039d06c8f6
[mangaread] add 'chapter' and 'manga' extractors
2022-11-13 16:00:34 +05:30
Mike Fährmann
70c7fbe89a
[instagram] add 'guide' extractor ( #3192 )
2022-11-11 20:21:41 +01:00
enduser420
93ea8ca8e3
[imxto] extract additional metadata ( #3175 )
2022-11-11 08:53:07 +01:00
Mike Fährmann
e3abab8629
[weibo] send 'Referer' headers ( #3188 )
2022-11-10 17:11:57 +01:00
Mike Fährmann
6423f990de
[realbooru] fix 'tags' extraction ( #2530 )
2022-11-10 17:04:02 +01:00
Mike Fährmann
ecad02cf3f
[realbooru] fix download URLs ( #2530 )
2022-11-10 13:29:35 +01:00
Mike Fährmann
15cd114c9c
[twitter] update bookmarks pagination ( #3172 )
...
Do not stop when there aren't any tweets in a batch,
but only when the same cursor value appears twice in a row.
2022-11-09 20:40:51 +01:00
Mike Fährmann
20fbba9d7c
[exhentai] add metadata to search results ( #3181 )
...
'gallery_id' and 'gallery_token'
2022-11-09 11:40:27 +01:00
Mike Fährmann
6a0c5e34f4
[exhentai] fix pagination ( #3181 )
2022-11-09 11:15:34 +01:00
Mike Fährmann
171262c1b6
[instagram] remove login support
...
broken feature that I cannot get to work anymore
2022-11-08 21:48:31 +01:00
Mike Fährmann
93e6bd6847
[uploadir] use utf-8 filenames ( #3162 )
2022-11-05 19:03:48 +01:00
Mike Fährmann
b7a83ac726
[uploadir] update ( #3162 )
...
- prevent extra HTTP request from redirects
- add 'id' metadata field
- set 'filename_fmt' and 'archive_fmt'
2022-11-05 18:28:35 +01:00
Mike Fährmann
ccb80f1b8b
[uploadir] add support for 'uploadir.com' ( #3162 )
2022-11-05 14:25:09 +01:00
Mike Fährmann
b0cb4a1b9c
replace 'text.extract()' with 'text.extr()' where possible
2022-11-05 01:14:09 +01:00
Mike Fährmann
4fd3c893fa
[booru] adjust/match '_tags' and '_notes' code
2022-11-04 19:49:39 +01:00
Mike Fährmann
88954aa2e4
[gelbooru_v02] implement 'notes' extraction
...
same code as for 'moebooru' works here as well
2022-11-04 19:49:39 +01:00
ClosedPort22
4e80d3210e
[tumblr] Fallback to gifv when possible ( #3095 ) ( #3159 )
2022-11-04 19:42:36 +01:00
thatfuckingbird
9d3f86dbcd
[twitter] update URL for syndication API ( #3160 )
...
Twitter changed the URL format to access tweet data through their syndication API.
2022-11-04 17:49:45 +01:00
enduser420
c01cad599a
[lolisafe] add support for xbunkr ( #3156 )
2022-11-04 16:13:52 +01:00
Allen
9fc142d27b
[mastodon] add "remote_instance" field ( #3119 )
...
Example Usage:
If the url is "mastodon:https://mastodon.example.org/@VoteChess@botsin.space the "remote_instance" will be "botsin.space"
...
"directory": ["mastodon", "{remote_instance|instance}", "{account[username]!l}"]
...
2022-11-02 17:09:38 +01:00
Mike Fährmann
2a1cb403ee
Revert "[Deviantart] [ #1776 ] Remove the "you need session cookies to download mature scraps" warning ( #1777 )"
...
This reverts commit 1f02878351 .
Mature scraps do yet again require cookies.
2022-11-01 21:23:05 +01:00
Mike Fährmann
86790da2d5
update Cloudflare IUAM detection
...
again
2022-10-31 18:33:52 +01:00
Mike Fährmann
775895f44b
[booru] refactor 'tags' and 'notes' extraction
...
- move HTML request for post pages into its own function
- move gelbooru_v02.py notes extraction to gelbooru.py
since it only works there
- clean up some code
2022-10-31 12:01:19 +01:00
Luc Ritchie
0f9dfb7e62
[instagram] Fix AttributeError on user stories extraction ( #3123 )
2022-10-30 11:47:06 +01:00
Mike Fährmann
f81dd5297a
[skeb] fix extraction ( #3112 )
...
'completed_at' is no longer included in API responses
2022-10-29 15:07:21 +02:00
enduser420
fb2dbb04e2
[moebooru] extract 'notes' ( #3094 )
2022-10-28 18:05:54 +02:00
Mike Fährmann
4e26bf98f5
[aibooru] support 'safe' subdomain ( #3110 )
2022-10-28 12:10:18 +02:00
Mike Fährmann
5c31791b3c
[mastodon] support '/web/' URLs ( #3109 )
2022-10-28 11:47:00 +02:00
Mike Fährmann
9a2cfd4421
[mastodon] support cross-instance user references ( #3109 )
2022-10-27 14:26:42 +02:00
Mike Fährmann
58d97188b4
[mastodon] add 'bookmark' extractor ( #3109 )
2022-10-26 21:28:50 +02:00
Mike Fährmann
46b64251eb
[bcy] fix extraction ( #3103 )
...
- fix regex for non-watermarked images
- fetch data from '/item/detail' pages for all other posts,
since '/apiv3/user/selfPosts' only has incomplete data
2022-10-26 15:33:07 +02:00
Mike Fährmann
77173694d5
[kemonoparty] fix 'dms' extraction ( #3106 )
2022-10-26 14:25:43 +02:00
Mike Fährmann
f168ec9572
[instagram] extract 'coauthors' metadata ( #3107 )
2022-10-26 14:11:29 +02:00
Mike Fährmann
7c6af27eb8
[tumblr] add 'fallback-*' options ( #2957 )
...
specifically 'fallback-delay' and 'fallback-retries'
and change default number of retries to 2 (down from 3)
2022-10-26 13:59:09 +02:00
Mike Fährmann
4aa56d500b
[hentaihere] fix test results
2022-10-24 16:56:27 +02:00
Mike Fährmann
75d707fd92
[hentaihere] update
...
- support minor versions in chapter URLs
- fix manga metadata extraction
- update tests
2022-10-24 16:41:22 +02:00
Mike Fährmann
d2fc73f20b
[hentai2read] fix manga metadata extraction
...
and update tests
2022-10-24 16:31:01 +02:00
Mike Fährmann
f4d06e5180
[manganelo] update domain to 'chapmanganato.com' ( #3097 )
2022-10-24 16:08:12 +02:00
Mike Fährmann
769e6754dc
[pixiv] use 'exact_match_for_tags' as default search mode ( #3092 )
2022-10-24 16:08:12 +02:00
Mike Fährmann
a90e5cb354
[instagram] support 'instagram.com/s/' highlight URLs ( #3076 )
2022-10-24 16:08:12 +02:00
enduser420
fd19c4b228
[hentai2read] recognize '.' in chapter ( #3089 )
2022-10-24 15:53:51 +02:00
enduser420
2ff1897421
[vichan] recognize board url w/o trailing slash ( #3087 )
2022-10-24 15:52:54 +02:00
enduser420
ac6111e693
[mangasee] add support for 'mangalife' ( #3086 )
2022-10-21 16:46:36 +02:00
KJ16609
300bc03deb
[gelbooru] allow alternate parameter order in post URLs ( #2821 )
2022-10-21 14:59:29 +02:00
Mike Fährmann
a7d23f1484
[vichan] add generic extractors for vichan imageboards
...
includes 8kun.top, smuglo.li, and wikieat.club
2022-10-21 14:40:45 +02:00
Mike Fährmann
04d3ebdfb4
[redgifs] fix 'token' extraction ( #3080 , #3081 )
2022-10-21 13:27:16 +02:00