Commit Graph

2876 Commits

Author SHA1 Message Date
Mike Fährmann
4e11ca737e [hentaifoundry] fix metadata extraction 2022-07-12 22:19:22 +02:00
Mike Fährmann
f2e59cc906 [slideshare] fix 'description' extraction 2022-07-12 18:38:44 +02:00
Mike Fährmann
31e868fca1 [khinsider] extract 'platform' metadata 2022-07-12 18:31:31 +02:00
Mike Fährmann
c6a9bab019 update extractor test results 2022-07-12 15:49:22 +02:00
Mike Fährmann
539e3bbed9 [weibo] handle invalid/broken status objects 2022-07-12 15:49:09 +02:00
Mike Fährmann
32c75d12e8 [sankaku] rewrite URLs to s.sankakucomplex.com (#2746) 2022-07-11 12:46:04 +02:00
Mike Fährmann
d5ded11aa8 [pixiv] fix default filenames for backgrounds 2022-07-11 12:45:38 +02:00
Mike Fährmann
e1f501ed14 [mangakakalot] update domain 2022-07-11 00:29:25 +02:00
Mike Fährmann
2dc57637cf [foolfuuka] remove archive.wakarimasen.moe 2022-07-10 23:13:49 +02:00
Mike Fährmann
98744977cf [itaku] fix 'date' parsing 2022-07-10 20:45:51 +02:00
Mike Fährmann
b590774f67 [twitter] add 'count' metadata field (#2741) 2022-07-10 14:37:04 +02:00
Mike Fährmann
7c0505868c [kemonoparty] ensure all files have an 'extension' (#2740) 2022-07-10 13:53:07 +02:00
Mike Fährmann
e4f48cc810 make it easier to disable default 'browser' settings
Previously it was necessary to set 'browser' to a non-empty, non-string
value to disable any default 'browser' value.
Now '-o browser=' or '-o browser=false' is enough.
2022-07-07 11:17:43 +02:00
Mike Fährmann
92b75bcdce limit path length for --write-pages output on Windows (#2733) 2022-07-06 18:56:23 +02:00
Mike Fährmann
311e9383af [pinterest] handle section pins with separate extractors (#2684) 2022-07-03 18:12:16 +02:00
Mike Fährmann
1d14928bd9 [twitter] ignore previously seen Tweets (#2712)
occurs primarily for /with_replies results when logged in
2022-07-03 16:13:53 +02:00
Mike Fährmann
4b2a0a0eda [twitter] implement 'strategy' option (#2712)
to be able to better control what Tweets get used an returned
for twitter.com/USER URLs.
2022-07-03 14:29:15 +02:00
Mike Fährmann
c794777600 [newgrounds] prevent exception on empty results (#2727) 2022-07-03 11:44:46 +02:00
Mike Fährmann
36ead45546 [itaku] fix caching bug (#1842)
ItakuApi.user() would always return the first user it was called with,
regardless of its 'username' argument.
2022-07-01 21:00:39 +02:00
Mike Fährmann
127a190c94 [itaku] categorize sections by group (#1842) 2022-06-29 22:34:07 +02:00
Mike Fährmann
de20cadc68 add 'brotli' as optional dependency (#2716)
only send 'Accept-Encoding: br' if supported
2022-06-29 15:10:05 +02:00
Mike Fährmann
37453a9528 [newgrounds] only login if necessary (#2715) 2022-06-29 11:46:07 +02:00
Mike Fährmann
7b073bf9ef Revert "[twitter] improve strategy for user URLs (#2665)"
'user_tweets_and_replies' was a mistake
2022-06-28 20:38:56 +02:00
Mike Fährmann
3a5d5c3a91 update default User-Agent header to Firefox 102 ESR
snd update headers and ciphers for "browser": "firefox"
2022-06-28 17:38:58 +02:00
Mike Fährmann
f8cfc3b08a [skeb] add 'following' extractor (#2698) 2022-06-27 11:31:49 +02:00
Mike Fährmann
367a491128 [vk] get URLs from *_src entries (#2535)
https://github.com/mikf/gallery-dl/issues/2535#issuecomment-1166566986
2022-06-27 10:24:35 +02:00
Mike Fährmann
241e82e18d [horne] add support for horne.red (#2700) 2022-06-25 16:52:16 +02:00
Mike Fährmann
7af4d2047b [instagram] improve metadata generated by _parse_post_api()
(#2695)
2022-06-24 23:44:20 +02:00
Mike Fährmann
3f50e2fb5f [poipiku] add simple password support (#1602) 2022-06-22 18:21:01 +02:00
Mike Fährmann
9d8e99af80 [itaku] support videos (#1842) 2022-06-20 19:47:53 +02:00
Mike Fährmann
c8ec2c4e85 [itaku] add 'title' to default filenames (#1842) 2022-06-20 19:35:46 +02:00
Mike Fährmann
e0c60a1206 [itaku] metadata cleanup (#1842)
- parse 'date_added' as 'date'
- simplify 'tags', 'categorized_tags',  and 'sections'
2022-06-20 19:28:15 +02:00
Mike Fährmann
27e8078fb7 [poipiku] add 'user' and 'post' extractors (#1602) 2022-06-20 11:32:02 +02:00
Mike Fährmann
fa902cd54d [itaku] add 'gallery' and 'image' extractors (#1842) 2022-06-20 11:31:44 +02:00
Mike Fährmann
d6c6c8a4a0 [twitter] improve '"replies": "self"' (#2665)
If a username is given in the input URL,
only download from replies by that user.
2022-06-13 19:21:32 +02:00
Mike Fährmann
9c8d895d19 [twitter] implement 'csrf' option (#2676) 2022-06-13 18:36:39 +02:00
Mike Fährmann
08db8435f1 [twitter] fix pagination for conversion tweets
a relic from the switch to GraphQL API
2022-06-13 16:27:30 +02:00
Mike Fährmann
78d83345d3 [cyberdrop] add fallback URLs (#2668) 2022-06-13 14:19:40 +02:00
Mike Fährmann
834e900037 [unsplash] add collection_title and …_id metadata fields (#2670) 2022-06-12 18:26:20 +02:00
Mike Fährmann
6db77d4656 [weibo] support '?tabtype=video' listings (#2601) 2022-06-12 17:55:23 +02:00
Mike Fährmann
1da3ccf608 [twitter] implement 'expand' option (#2665) 2022-06-12 17:26:51 +02:00
Mike Fährmann
0add1fc090 [twitter] improve strategy for user URLs (#2665)
- use '/with_replies' when appropriate
- consider 'text-tweets'
- build search query as necessary
2022-06-12 17:24:53 +02:00
Mike Fährmann
45c980daf0 [weibo] fix retweets (#2601) 2022-06-11 15:30:26 +02:00
Mike Fährmann
ae1b24aa6a [instagram] automatically invalidate expired login sessions 2022-06-10 22:43:11 +02:00
Mike Fährmann
47a92c8c7e [instagram] provide 'date' for 'carousel_media' files (#2660) 2022-06-07 15:11:49 +02:00
Mike Fährmann
2064f20e11 [instagram} fix 'tag' extractor (#2659) 2022-06-06 14:30:44 +02:00
Mike Fährmann
6c0fa2f258 [readcomiconline] update 2022-06-05 21:40:08 +02:00
Mike Fährmann
61cbf8318c [weibo] fix URLs generated by 'user' extractor (#2601) 2022-06-05 21:37:57 +02:00
Mike Fährmann
4b78bd423f [paheal] add 'metadata' option (#2641) 2022-06-04 16:05:49 +02:00
Mike Fährmann
535cbcb185 cache extracted browser cookies
(in memory, for as long as gallery-dl is running)

Extracting encrypted cookies from a chromium-based browser can take a
long time, so repeating this process for each extractor should be
avoided.

Same goes for creating a temporary copy of the entire cookie database.
2022-06-04 12:38:38 +02:00