Commit Graph

4601 Commits

Author SHA1 Message Date
Mike Fährmann
d0184fddcf [twitter] optimize '_extract_twitpic()'
- use findall instead of finditer
- store URLs in a dict to discard duplicates
2023-05-25 15:18:49 +02:00
Mike Fährmann
3dc862c7fc merge #3796: [twitter] extract TwitPic URLs in text (#3792) 2023-05-25 14:59:07 +02:00
Mike Fährmann
243de697b9 merge #3976: [reddit] support cross-posted media (#887, #3586) 2023-05-25 13:16:25 +02:00
Mike Fährmann
f8c4c5eef9 [reddit] simplify and add tests 2023-05-25 13:15:11 +02:00
Mike Fährmann
f3cca50b9e [mangadex] update links to API docs 2023-05-24 12:32:13 +02:00
Mike Fährmann
65a9f4b124 merge #3950: [misskey] add 'favorite' extractor 2023-05-23 22:18:32 +02:00
Mike Fährmann
c76f0f3a1b [misskey] update
- rename to 'MisskeyFavoriteExtractor'
- add 'access-token' option to docs
- add test URLs for other instances
- simplify 'pattern'
2023-05-23 22:17:17 +02:00
Mike Fährmann
3fca455b82 [pixiv] add 'embeds' option (#1241) 2023-05-23 12:14:06 +02:00
Mike Fährmann
d1f2ef3b7b [imagechest] update
- don't load HTML page when using API
- restructure some code
- add more methods to ImagechestAPI
2023-05-22 22:42:16 +02:00
Mike Fährmann
856f6c10cd allow for GalleryExtractors to skip loading gallery_url 2023-05-22 22:29:30 +02:00
Mike Fährmann
9810ab35af [docs] update formatting.md
- add element access with quotes
- fix \f formatting
- remove note about typing \f in a shell (9e6c9813)
2023-05-22 22:24:23 +02:00
Mike Fährmann
4fc9675d48 [fanbox] skip 404ed or otherwise invalid posts (#4088) 2023-05-22 18:49:06 +02:00
Mike Fährmann
69865dcc05 [formatter] implement slicing strings as bytes (#4087)
prefixing a slice '[10:30]' with a lowercase b '[b10:30]' encodes
the string to bytes in filesystem encoding before applying the slice
2023-05-22 18:30:45 +02:00
Mike Fährmann
56b8b8cd36 [pixiv] support short novel URLs
https://www.pixiv.net/n/<ID>
2023-05-21 14:26:30 +02:00
Mike Fährmann
e6f55d1555 [imagechest] add API support and 'access-token' option (#4065) 2023-05-20 17:00:27 +02:00
Mike Fährmann
77abcf5ab3 [gofile] automatically fetch 'website-token' by default
the hardcoded token changed yet again
2023-05-20 17:00:27 +02:00
Mike Fährmann
e3fed9bd17 [tcbscans] update domain to 'tcbscans.com' (#4080) 2023-05-20 13:59:35 +02:00
Mike Fährmann
a83983c651 [instagram] add 'order-posts' option (#4017, #3993) 2023-05-18 22:50:12 +02:00
Mike Fährmann
d680623db3 [instagram] add 'order-files' option (#4017, #3993) 2023-05-18 22:34:33 +02:00
Naatie
f9b7a033e0 [misskey] refactor misskey extractor 2023-05-18 20:15:59 +07:00
Naatie
04dbfd994e [misskey] add my favorites extractor 2023-05-18 20:15:59 +07:00
Mike Fährmann
82a12d6126 [nsfwalbum] detect placeholder images
patch by an anonymous contributor
2023-05-17 15:18:10 +02:00
Mike Fährmann
011e4607c3 [poipiku] extract full 'descriptions' (#4066)
don't cut it off after the first line
2023-05-15 23:16:31 +02:00
Mike Fährmann
5037013e2b [gofile] update 'website-token' (#4056) 2023-05-15 14:59:44 +02:00
Mike Fährmann
6b6bb4be73 [weibo] require numeric IDs to have length >= 10 (#4059) 2023-05-14 18:45:37 +02:00
Mike Fährmann
56db930c47 don't include 'toml' package in PyInstaller binary
My own system now has Python 3.11 with native TOML support.
2023-05-14 18:43:37 +02:00
Mike Fährmann
494acabd38 [danbooru] refactor pagination logic (#4002)
- only use 'b<ID>' when no other order is specified
- support 'a<ID>' when  using 'order:id' as tag
2023-05-14 18:39:59 +02:00
Mike Fährmann
fd0e1ffd6e [danbooru] improve 75666cf9 (#4002)
Search for direct post IDs instead of trying to
replicate the same results as the initial request.
2023-05-14 16:35:31 +02:00
Mike Fährmann
e41e45ff6b [gofile] add basic password support (#4056) 2023-05-13 15:46:29 +02:00
Mike Fährmann
790dd365e1 [postprocessor:exec] support tilde expansion for 'command'
https://github.com/mikf/gallery-dl/issues/146#issuecomment-1544733532
2023-05-12 16:34:17 +02:00
Mike Fährmann
2e6cea95db [cookies] update logging behavior (#4050)
- only show the same warning/error once
- simplify and capitalize logging messages
2023-05-12 16:34:17 +02:00
Mike Fährmann
20dc13f832 [pixiv] initial 'novel' support (#1241, #4044)
supported URLs are
- https://www.pixiv.net/novel/show.php?id=<ID>
- https://www.pixiv.net/novel/series/<ID>
- https://www.pixiv.net/en/users/<ID>/novels
2023-05-12 16:34:08 +02:00
Mike Fährmann
c698c3de44 [newgrounds] add default delay between requests (#4046) 2023-05-11 16:04:37 +02:00
Mike Fährmann
708f478d15 [danbooru][e621] add 'date' metadata field (#4047) 2023-05-11 15:59:42 +02:00
Mike Fährmann
306e13a4d4 release version 1.25.4 2023-05-07 20:57:36 +02:00
Mike Fährmann
35c23a2fd8 merge #4031: [mangadex] add 'status' and 'tags' metadata 2023-05-07 20:16:23 +02:00
Mike Fährmann
2266fc8cc5 [mangadex] update and extend test results 2023-05-07 20:14:49 +02:00
Janne Alaranta
1ce5dc9e18 fix whitespaces 2023-05-07 18:47:04 +03:00
Janne Alaranta
13dedae09f add status and tags info to mangadex extractor 2023-05-07 18:35:02 +03:00
Mike Fährmann
be0fa94b2e [imagechest] load all images when a 'Load More' button is present
(#4028)
2023-05-07 14:46:48 +02:00
Mike Fährmann
7eadcbea70 [4chanarchives] add end condition for 'board' extractor (#4012) 2023-05-06 20:52:45 +02:00
Mike Fährmann
1406f7125f [4chanarchives] add 'thread' and 'board' extractors (#4012) 2023-05-06 20:45:57 +02:00
Mike Fährmann
285391df43 add '-C' as short option for '--cookies'
and put cookie options into their own section
2023-05-06 20:45:44 +02:00
Mike Fährmann
b9b1cdd71b add '--cookies-export' command-line option 2023-05-06 17:37:49 +02:00
Mike Fährmann
d12dd3813c [imgur] fix internal image/album URLs
URLs from "link" attributes of newer images/albums were all returned
as 'https://imgur.com/gallery/...' instead of the expected format,
causing them to be ignored.
2023-05-06 15:13:38 +02:00
Mike Fährmann
8520de57f0 [imgur] add 'favorite-folder' extractor (#4016) 2023-05-06 15:10:13 +02:00
Mike Fährmann
4c1f3b2160 [cookies] simplify '_mac_absolute_time_to_posix()'
hardcode UNIX timestamp of 2001-01-01
2023-05-06 15:10:13 +02:00
Mike Fährmann
a14b63d941 support selecting a domain for '--cookies-from-browser'
for example 'gallery-dl --cookies-from-browser firefox/twitter.com'
2023-05-06 15:09:49 +02:00
Mike Fährmann
3ca5dac8b6 extend 'cookies-update' functionality
Allow writing cookies to a different file than a given cookies.txt,
making it possible to export cookies imported with --cookies-from-browser

To convert browser cookies to cookies.txt format:
  gallery-dl --cookies-fr chromium \
             -o cookies-update=cookies.txt \
             --no-download \
             http://example.org/file.jpg
2023-05-04 15:10:47 +02:00
Mike Fährmann
bc6d65d203 implement 'Extractor.config_deprecated()'
a version of 'Extractor.config()'
that logs a warning when using a deprecated option name
2023-05-04 10:49:14 +02:00