ClosedPort22
ce8dbb1ccc
[twitter] fix crash when using 'expand' and 'syndication'
...
caused by KeyError: 'conversation_id_str'
2022-12-30 12:45:44 +08:00
ClosedPort22
38786a9593
[twitter] refactor extraction of TwitPic URLs
...
flattening
2022-12-27 12:23:12 +08:00
enduser420
527bb2c4ab
[lynxchan/bbw-chan] add 'thread' and 'board' extractors
2022-12-26 20:59:42 +05:30
blankie
f82ee93676
[danbooru] extract uploader metadata (if metadata is set)
2022-12-25 11:00:59 +07:00
ClosedPort22
250d35107c
[twitter] prioritize tweet type checks ( #3439 )
...
Do not consider a tweet seen before applying 'retweet', 'quote' and
'reply' checks. Otherwise the original tweets will also be skipped if
the "derivative" tweets and the original tweets are from the same user.
2022-12-24 12:50:52 +08:00
ClosedPort22
3eb352fcb0
[twitter] force HTTPS for TwitPic URLs
2022-12-23 18:16:34 +08:00
lx30011
895b41f1ac
[jschan] add generic jschan extractor
2022-12-23 00:32:52 +01:00
Mike Fährmann
bee354c264
Merge pull request #3415 from enduser420/extractor/fapello
...
[fapello] add 'post', 'user' and 'path' extractors
2022-12-17 20:05:46 +01:00
Mike Fährmann
8d7585534e
Merge pull request #3367 from the-blank-x/deviantart-view
...
[deviantart] add /view URL support
2022-12-17 20:05:35 +01:00
blankie
6614d94b08
[deviantart] add /view URL support
2022-12-17 23:27:55 +07:00
Mike Fährmann
dd6eeb4336
Merge pull request #3366 from ClosedPort22/da-extra-stash
...
[deviantart] extract sta.sh URLs from `text_content`
2022-12-17 15:42:12 +01:00
Mike Fährmann
f36cbb3911
Merge pull request #3413 from ClosedPort22/e621-manual-pagination
...
[e621] implement manual pagination
2022-12-17 15:42:00 +01:00
ClosedPort22
dd4a4a3fa6
[e621] softcode the pagination threshold
2022-12-17 21:06:47 +08:00
ClosedPort22
9faa4ed738
[e621] refactor pagination control
...
as suggested by @mikf
2022-12-17 20:05:34 +08:00
Mike Fährmann
7851a2c520
[seiga] raise error when redirected to login page ( #3401 )
2022-12-17 12:52:17 +01:00
Mike Fährmann
68ce5f965d
[instagram] remove unused code
2022-12-17 12:52:17 +01:00
Mike Fährmann
4063563cd7
[zerochan] update for layout v3
...
- remove cookie disabling v3
- fix and improve metadata extraction
2022-12-17 12:51:51 +01:00
Mike Fährmann
1e6407ca98
Merge pull request #3414 from pubak42/master
...
[sex.com] Download videos from cdn (#3408 )
2022-12-17 12:30:34 +01:00
ClosedPort22
bf1649dadb
[imgur] add support for imgur.io URLs
2022-12-17 14:33:44 +08:00
enduser420
7e08e2d982
[fapello] set 'filename_fmt'
2022-12-16 20:57:39 +05:30
enduser420
e5076ba056
[fapello] add 'post', 'user' and 'path' extractors
2022-12-16 16:53:32 +05:30
pubak42
e7326cdf1d
[sex.com] Download videos from cdn ( #3408 )
...
The format of video sources was changed recently to be a full URL with https:// in the beginning.
The original extractor code appended the video source URL to root url of the website, thus yielding
invalid url in format ...sex.comhttps... that failed to resolve.
2022-12-16 10:16:40 +01:00
ClosedPort22
d0ad6d0e67
[e621] implement manual pagination mode
2022-12-16 16:15:36 +08:00
Mike Fährmann
6f0735568c
[2chen] fix file URLs
2022-12-15 18:05:32 +01:00
enduser420
a2be06d873
[2chen] add '.club' support ( #3406 )
2022-12-15 17:51:02 +01:00
Mike Fährmann
a6d4733e11
[pixiv] extract 'date_url' metadata ( #3405 )
...
i.e. the datetime encoded in each file URL.
https://i.pximg.net/img-master/img/2022/12/01/13/44/55/12345678_p0.jpg
->
2022-12-01 13:44:55 +09:00
->
2022-12-01 04:44:55
2022-12-15 11:40:20 +01:00
Mike Fährmann
1317625ec4
[webmshare] add 'video' extractor ( #2410 )
2022-12-14 19:59:07 +01:00
Mike Fährmann
90a9c0790f
[twitter] update 'search' pagination ( #544 )
...
Only stop when list of all returned Tweets is empty
instead of when no valid Tweet was found.
2022-12-14 19:56:59 +01:00
Mike Fährmann
1cbc234819
[mangafox] extract more metadata ( #3167 )
2022-12-14 19:56:59 +01:00
Mike Fährmann
3082544fff
misc fixes
...
- fix typo (#3399 )
- remove double assignment
- [bunkr] update things I forgot in 6b6f886d
- [soundgasm] adjust 'archive_fmt' (#3388 )
2022-12-14 13:30:27 +01:00
enduser420
41bf236d36
[lynxchan] add generic extractors for lynxchan imageboards ( #3394 )
...
* [lynxchan] add generic extractors for lynxchan imageboards
includes kohlchan.net, endchan.org:wq
* [lynxchan] set pop default to empty tuple
* Apply suggestions from code review
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2022-12-12 13:13:39 +01:00
Mike Fährmann
3c75c3bbc4
[soundgasm] add 'user' extractor ( #3384 )
...
based on code from PR #3388 by @enduser420
2022-12-12 12:23:59 +01:00
Mike Fährmann
2952add4a8
[reddit] increase 'id-max' default value ( #3397 )
...
to float("inf")
2022-12-12 12:10:06 +01:00
Mike Fährmann
a001c9c06f
[instagram] prevent post 'date' overwriting file 'date' ( #3392 )
2022-12-12 11:38:48 +01:00
Mike Fährmann
6b6f886dcf
[bunkr] update domain ( #3391 )
...
and improve bunkr/app.bunkr handling
2022-12-11 23:18:01 +01:00
ClosedPort22
bf3fd5951a
Merge branch 'master' into da-extra-stash
2022-12-10 21:34:37 +08:00
Mike Fährmann
eb94568e1f
[soundgasm] add 'audio' extractor ( #3384 )
2022-12-09 23:19:07 +01:00
Mike Fährmann
cd931e1139
update extractor test results
2022-12-08 18:58:29 +01:00
Mike Fährmann
989ec9fc79
[khinsider] fix metadata extraction
2022-12-07 21:36:34 +01:00
Mike Fährmann
1c25cc7a3e
[warosu] fix and update
2022-12-07 21:23:45 +01:00
Mike Fährmann
79e52f3539
[imgth] rewrite
...
- inherit from GalleryExtractor
- fix image URLs
- better metadata
2022-12-07 21:19:06 +01:00
Mike Fährmann
202c1210d5
[exhentai] fix pagination
2022-12-06 22:39:13 +01:00
Mike Fährmann
4a3a1f4c87
[komikcast] update domain and fix extraction
2022-12-06 22:00:23 +01:00
ClosedPort22
13d825731e
[deviantart] fix test for sta.sh URL extraction
...
Without the 'count' assertion, the test would be essentially useless.
2022-12-07 00:32:03 +08:00
ClosedPort22
6356c9be96
[deviantart] extract sta.sh URLs from 'text_content'
2022-12-06 22:57:53 +08:00
Mike Fährmann
5f57a27ba6
[imagetwist] fix extraction
2022-12-05 22:15:59 +01:00
Mike Fährmann
a42ba25ca1
[foolslide] remove 'kireicake'
...
site redirects to (unclaimed) mangadex group
2022-12-05 19:47:56 +01:00
Mike Fährmann
86f0597c95
[kissgoddess] remove module
...
site does not host albums anymore
2022-12-05 19:28:50 +01:00
Mike Fährmann
20e12b5d7c
[nitter] support '/i/user/' URLs ( #3310 )
...
as well as using 'id:<userid>' as username
not all nitter instances seem to support '/i/user/' ...
2022-12-04 16:21:13 +01:00
Mike Fährmann
fceaee3c4f
[lolisafe] remove zz.ht
2022-12-04 16:21:13 +01:00