Mike Fährmann
9695c4e88d
emit debug logging message when loading cookies from file
...
attempt nr. 2
no idea how I managed to remove 6514828d in a918ce29
2023-01-06 11:13:44 +01:00
Mike Fährmann
30a31836e7
merge #3449 : [twitter] force HTTPS for TwitPic URLs
2023-01-05 14:57:03 +01:00
Mike Fährmann
e18482e9ae
[twitter] improve 'http' -> 'https' replacement
2023-01-05 14:55:55 +01:00
Mike Fährmann
4fd6da474f
merge #3473 : [twitter] fix crash when using 'expand' and 'syndication'
2023-01-05 14:19:47 +01:00
Mike Fährmann
a918ce29b5
run tests on ubuntu-20.04
...
and remove Python 3.4, since that's no longer available
on this test runner
2023-01-05 13:33:27 +01:00
Mike Fährmann
6514828d4e
emit debug logging message when loading cookies from file
2023-01-05 12:40:22 +01:00
Mike Fährmann
3a238fd490
[poipiku] warn about login requirements
2023-01-05 12:29:31 +01:00
Mike Fährmann
f29ba089ff
merge #3474 : [fanleaks] add 'post' and 'model' extractors
2023-01-04 17:39:17 +01:00
Mike Fährmann
6933727b45
merge #3483 : [twitter] implement 'syndication=extended'
2023-01-04 17:36:17 +01:00
Mike Fährmann
07ed3a1fbf
merge #3460 : [poipiku] fix extraction for a different warning button style
...
(#3493 , #3492 )
2023-01-04 17:29:27 +01:00
Mike Fährmann
9116398c1c
[pinterest] add 'domain' option ( #3484 )
...
use input URL domain by default
2023-01-04 17:20:14 +01:00
blankie
2f985bcddb
[poipiku] fix extraction for a different warning button style
2023-01-04 08:28:31 +07:00
Mike Fährmann
294108c90a
[pinterest] support 'All Pins' boards ( #2855 , #3484 )
2023-01-03 19:11:20 +01:00
Mike Fährmann
77df8d3116
[deviantart] implement username&password login for scraps ( #1029 )
...
re-login when getting prematurely logged out by dA
is missing at the moment
2023-01-03 18:42:28 +01:00
Mike Fährmann
ed2d715019
fix 'keywords' in extractor tests ( #3491 )
2023-01-03 15:14:23 +01:00
ClosedPort22
6853b14be3
[twitter] apply suggestions from code review
...
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2023-01-02 21:03:01 +08:00
Mike Fährmann
4611237f8c
merge #3457 : [danbooru] extract uploader metadata (if option is set)
2023-01-01 21:20:39 +01:00
Mike Fährmann
e7522482bb
merge #3463 : [lynxchan] support 'bbw-chan.nl'
2023-01-01 15:50:09 +01:00
Mike Fährmann
7d6c846176
[fanbox] return 'imageMap' files in order ( #2718 )
2023-01-01 15:12:05 +01:00
Mike Fährmann
dc8e7ff54e
[bunkr] fix URLs returned by API ( #3481 )
2023-01-01 14:07:15 +01:00
enduser420
5fedef3a1a
[fanleaks] update 'model' URL pattern
2022-12-30 19:29:55 +05:30
enduser420
5a740ef78b
[fanleaks] add 'post' and 'model' extractors
2022-12-30 19:24:05 +05:30
ClosedPort22
7c8eab8d52
[twitter] implement 'syndication=extended'
...
to be able to fetch extended user metadata
2022-12-30 20:48:41 +08:00
ClosedPort22
be3286206a
[twitter] assume 'conversation_id' when using syndication
...
not possible to expand replies at the momemt
2022-12-30 13:57:37 +08:00
ClosedPort22
ce8dbb1ccc
[twitter] fix crash when using 'expand' and 'syndication'
...
caused by KeyError: 'conversation_id_str'
2022-12-30 12:45:44 +08:00
ClosedPort22
38786a9593
[twitter] refactor extraction of TwitPic URLs
...
flattening
2022-12-27 12:23:12 +08:00
enduser420
527bb2c4ab
[lynxchan/bbw-chan] add 'thread' and 'board' extractors
2022-12-26 20:59:42 +05:30
blankie
f82ee93676
[danbooru] extract uploader metadata (if metadata is set)
2022-12-25 11:00:59 +07:00
ClosedPort22
250d35107c
[twitter] prioritize tweet type checks ( #3439 )
...
Do not consider a tweet seen before applying 'retweet', 'quote' and
'reply' checks. Otherwise the original tweets will also be skipped if
the "derivative" tweets and the original tweets are from the same user.
2022-12-24 12:50:52 +08:00
ClosedPort22
3eb352fcb0
[twitter] force HTTPS for TwitPic URLs
2022-12-23 18:16:34 +08:00
Mike Fährmann
bee354c264
Merge pull request #3415 from enduser420/extractor/fapello
...
[fapello] add 'post', 'user' and 'path' extractors
2022-12-17 20:05:46 +01:00
Mike Fährmann
8d7585534e
Merge pull request #3367 from the-blank-x/deviantart-view
...
[deviantart] add /view URL support
2022-12-17 20:05:35 +01:00
blankie
6614d94b08
[deviantart] add /view URL support
2022-12-17 23:27:55 +07:00
Mike Fährmann
dd6eeb4336
Merge pull request #3366 from ClosedPort22/da-extra-stash
...
[deviantart] extract sta.sh URLs from `text_content`
2022-12-17 15:42:12 +01:00
Mike Fährmann
f36cbb3911
Merge pull request #3413 from ClosedPort22/e621-manual-pagination
...
[e621] implement manual pagination
2022-12-17 15:42:00 +01:00
ClosedPort22
dd4a4a3fa6
[e621] softcode the pagination threshold
2022-12-17 21:06:47 +08:00
ClosedPort22
9faa4ed738
[e621] refactor pagination control
...
as suggested by @mikf
2022-12-17 20:05:34 +08:00
Mike Fährmann
7851a2c520
[seiga] raise error when redirected to login page ( #3401 )
2022-12-17 12:52:17 +01:00
Mike Fährmann
68ce5f965d
[instagram] remove unused code
2022-12-17 12:52:17 +01:00
Mike Fährmann
4063563cd7
[zerochan] update for layout v3
...
- remove cookie disabling v3
- fix and improve metadata extraction
2022-12-17 12:51:51 +01:00
Mike Fährmann
1e6407ca98
Merge pull request #3414 from pubak42/master
...
[sex.com] Download videos from cdn (#3408 )
2022-12-17 12:30:34 +01:00
ClosedPort22
bf1649dadb
[imgur] add support for imgur.io URLs
2022-12-17 14:33:44 +08:00
enduser420
7e08e2d982
[fapello] set 'filename_fmt'
2022-12-16 20:57:39 +05:30
enduser420
e5076ba056
[fapello] add 'post', 'user' and 'path' extractors
2022-12-16 16:53:32 +05:30
pubak42
e7326cdf1d
[sex.com] Download videos from cdn ( #3408 )
...
The format of video sources was changed recently to be a full URL with https:// in the beginning.
The original extractor code appended the video source URL to root url of the website, thus yielding
invalid url in format ...sex.comhttps... that failed to resolve.
2022-12-16 10:16:40 +01:00
ClosedPort22
d0ad6d0e67
[e621] implement manual pagination mode
2022-12-16 16:15:36 +08:00
Mike Fährmann
6f0735568c
[2chen] fix file URLs
2022-12-15 18:05:32 +01:00
enduser420
a2be06d873
[2chen] add '.club' support ( #3406 )
2022-12-15 17:51:02 +01:00
Mike Fährmann
a6d4733e11
[pixiv] extract 'date_url' metadata ( #3405 )
...
i.e. the datetime encoded in each file URL.
https://i.pximg.net/img-master/img/2022/12/01/13/44/55/12345678_p0.jpg
->
2022-12-01 13:44:55 +09:00
->
2022-12-01 04:44:55
2022-12-15 11:40:20 +01:00
Mike Fährmann
1317625ec4
[webmshare] add 'video' extractor ( #2410 )
2022-12-14 19:59:07 +01:00