Commit Graph

6351 Commits

Author SHA1 Message Date
Mike Fährmann
f15538938e [jpgfish] decrypt file URLs (#7073 #7079) 2025-03-02 15:17:07 +01:00
Mike Fährmann
262e8628fa [tiktok] improve 'tiktok-range' parsing (#7098)
replace values evaluating as False (except 0) with ""
2025-03-02 14:50:49 +01:00
Mike Fährmann
4ecc40ce4b [docs] fix 'tiktok-range' default value (#7098) 2025-03-02 14:34:30 +01:00
Mike Fährmann
db19990a82 [text] allow calling 'extract_iter' with invalid arguments 2025-03-02 10:44:06 +01:00
Mike Fährmann
472ea737d8 [erome] fix AttributeError for albums without tags (#7076)
fixes regression introduced in 07a9dca459
2025-03-01 21:18:57 +01:00
Mike Fährmann
71ebbfa577 release version 1.29.0 2025-03-01 20:08:19 +01:00
Mike Fährmann
1a6fa70e5e [archive] fix exception when specifying path as list (#7088) 2025-02-28 21:24:25 +01:00
Mike Fährmann
6c8c264cf5 [dl:ytdl] log ytdl module and version when importing 2025-02-28 21:18:42 +01:00
Mike Fährmann
7c33b9fbfe merge #7070: [bunkr] decrypt file URLs (#7058 #7085) 2025-02-28 15:33:46 +01:00
okbuddyagi
16c33527fa [bunkr] bypassed URL encryption
- fixed imports
- formatting
- revert unrelated formatting changes
- cleanup / simplify
2025-02-28 15:31:35 +01:00
Mike Fährmann
f86ff15146 merge #7067: [discord] add single message support (#7059) 2025-02-28 10:49:53 +01:00
Mike Fährmann
fa7114ee20 [docs] update supportedsites 2025-02-28 10:48:28 +01:00
Mike Fährmann
aeb72de575 [chevereto] fix extraction (#7073 #7079)
extract file URLs from an alternate location
if 'og:image' is not set
2025-02-28 10:34:00 +01:00
Mike Fährmann
8256a7a8e4 [sankaku] fix extraction (#7071 #7072)
omit 'Platform: web-app' API header to get sankaku to include
'file_url' data in API responses again
2025-02-28 10:28:29 +01:00
Mike Fährmann
07a9dca459 [erome] extract 'tags' metadata (#7076) 2025-02-28 10:15:05 +01:00
Luca Russo
95c1feab1c [discord] add single message support 2025-02-26 22:16:53 +01:00
Mike Fährmann
1d3b9a9e40 [tiktok] fix variable name
fixes regression introduced in 5e87aee32d
2025-02-26 21:53:32 +01:00
Mike Fährmann
4a3db15fc4 [discord] do not match message URLs as channels (#7059) 2025-02-26 21:11:59 +01:00
Mike Fährmann
afde4ad343 [tiktok] add 'avatar' option 2025-02-26 21:09:57 +01:00
Mike Fährmann
5e87aee32d [tiktok] add 'audio' option (#7060) 2025-02-26 21:02:33 +01:00
Mike Fährmann
13c3fa45f7 [docs] add 'tiktok' options (#7060) 2025-02-26 20:45:25 +01:00
Mike Fährmann
203c2e3492 [vipergirls] change default 'domain' to 'viper.click' (4166)
https://github.com/mikf/gallery-dl/issues/4166#issuecomment-2684014628

and update general 'domain' handling
2025-02-26 10:41:51 +01:00
Mike Fährmann
d2cad599f7 [twitter] support 'grok' cards content (#7040) 2025-02-25 20:47:31 +01:00
Mike Fährmann
876169ded5 [furaffinity] use a 1s delay between requests by default (#7054) 2025-02-25 20:12:54 +01:00
Mike Fährmann
87e3cb17e6 [oauth] improve wording when opening URL 2025-02-25 20:12:54 +01:00
CasualYouTuber31
daac2c6e04 [tiktok] add support (#3061 #4177 #5646 #6878 #6708)
* Add TikTok photo support

#3061
#4177

* Address linting errors

* Fix more test failures

* Forgot to update category names in tests

* Looking into re issue

* Follow default yt-dlp output template

* Fix format string error on 3.5

* Support downloading videos and audio

Respond to comments
Improve archiving and file naming

* Forgot to update supportedsites.md

* Support user profiles

* Fix indentation

* Prevent matching with more than one TikTok extractor

* Fix TikTok regex

* Support TikTok profile avatars

* Fix supportedsites.md

* TikTok: Ignore no formats error

In my limited experience, this doesn't mean that gallery-dl can't download the photo post (but this could mean that you can't download the audio)

* Fix error reporting message

* TikTok: Support more URL formats

vt.tiktok.com
www.tiktok.com/t/

* TikTok: Only download avatar when extracting user profile

* TikTok: Document profile avatar limitation

* TikTok: Add support for www.tiktokv.com/share links

* Address Share -> Sharepost issue

* TikTok: Export post's creation date in JSON (ISO 8601)

* [tiktok] update

* [tiktok] update 'vmpost' handling

just perform a HEAD request and handle its response

* [tiktok] build URLs from post IDs

instead of reusing unchanged input URLs

* [tiktok] combine 'post' and 'sharepost' extractors

* [tiktok] update default filenames

put 'id' and 'num' first to ensure better file order

* [tiktok] improve ytdl usage

- speed up extraction by passing '"extract_flat": True'
- pass more user options and cookies
- pre-define 'TikTokUser' extractor usage

* [tiktok] Add _COOKIES entry to AUTH_MAP

* [tiktok] Always download user avatars

* [tiktok] Add more documentation to supportedsites.md

* [tiktok] Address review comments

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-02-25 20:10:48 +01:00
Mike Fährmann
a9853cd273 merge #6781: [bilibili] add 'user-articles-favorite' extractor (#6725) 2025-02-23 18:19:51 +01:00
mmmpipi
e4cc3419c5 add bilibili User Articles FavList support
- fix whitespace
- fix extractor names
- Add favlist url user check
- apply changes
- add test
- update docs/supportedsites
2025-02-23 18:18:45 +01:00
Mike Fährmann
fe958ed5d9 merge #6768: [boosty] add 'direct-messages' extractor 2025-02-23 18:17:10 +01:00
Dominik Prange
ff5f6fe70f [boosty] added new direct message extractor
- formatting
- fixed linting formatting errors
- fixed E999 SyntaxError: invalid syntax
- fixed class naming
- fixed mandatory extractor.boosty.metadata as true requirement
- update
  - apply changes
  - add test
  - update docs/supportedsites
- improve 'dialog' pagination logic
2025-02-23 18:14:59 +01:00
Mike Fährmann
b1487df381 [scripts/pull-request] handle branch already existing 2025-02-23 18:12:14 +01:00
Mike Fährmann
613f05afa3 fix cmdline arguments not overriding extractor-downloader options 2025-02-22 17:40:27 +01:00
Mike Fährmann
18ed39c1cf implement 'downloader' options per extractor category
by setting options inside 'http' or 'ytdl' inside extractor options
or inside subcategory options

{
    "extractor": {
        "mastodon": {
            "http": {
                "rate": "10k"
            }
        },
        "mastodon.social": {
            "http": {
                "rate": "100k"
            }
        }
    },
    "downloader": {
        "rate": "100m"
    }
}

Sets download speed to
-  10k for mastodon.social URLs
- 100k for mastodon sites in general
- 100m for all other sites
2025-02-22 10:08:59 +01:00
Mike Fährmann
4906541f7d [generic] fix config lookups by subcategory
'subcategory' needs to be set before Extractor.__init__() runs
to be included in '_cfgpath'
2025-02-22 10:08:59 +01:00
Mike Fährmann
79dc04d87c [subscribestar] fix 'post' extractor (#6582)
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2675939669
2025-02-22 10:08:59 +01:00
Mike Fährmann
57937c3b68 [newgrounds] provide 'comment_html' metadata (#7038) 2025-02-22 10:08:58 +01:00
Mike Fährmann
196aa263c2 [imhentai] improve pagination duplicate filtering 2025-02-20 20:53:35 +01:00
Mike Fährmann
52d4e1a100 [imhentai] inherit from BaseExtractor
combine all imhentai-like sites into one module
2025-02-19 22:14:52 +01:00
Mike Fährmann
7a11d02e7a [reddit] restrict subreddit search results (#7025) 2025-02-19 20:05:48 +01:00
Mike Fährmann
d4c56b08d7 [hentaiera] add support (#3046 #6952 #7020) 2025-02-19 17:42:04 +01:00
Mike Fährmann
4396029d36 [furry34] add support (#1078 #7018) 2025-02-19 16:35:48 +01:00
Mike Fährmann
67937d33e3 [archive] fix NameError when SQLite database path doesn't exist
fixes regression introduced in 841bc9f6
2025-02-18 22:09:07 +01:00
Mike Fährmann
82493a6672 [hentairox] add support (#7003) 2025-02-18 21:45:30 +01:00
Luca Russo
95c446fcd1 [discord] add support (#6836)
* first commit

* add --

* skip video embeds

* fix typo

* removed ambiguity

* add category support

* code tweaks

* more reliable embed extraction

* handle 403 errors (testing done)

* added "parent_id" keyword

* added "parent", "parent_type" keywords

the extractor should be now ready to merge!

* removed unnecessary dict unpacking

* added empty text messages extraction

* added "channel_topic"

* even more metadata extraction

can now extract all embeds images & text, as well as server banners. also code is much better.

* added user avatar and banner

* better pagination

* fix regression

* minor tweaks

* Made requested changes
2025-02-18 18:45:39 +01:00
Mike Fährmann
fd4de02e67 [archive] support PostgreSQL archives for post processors (#6152) 2025-02-17 14:58:14 +01:00
Mike Fährmann
8daf496a22 [archive] add 'archive-table' option (#6152) 2025-02-17 11:41:13 +01:00
Mike Fährmann
dac0c4ac10 [docs] add 'psycopg' to optional dependencies 2025-02-17 10:59:15 +01:00
Mike Fährmann
841bc9f66f [archive] implement support for PostgreSQL databases (#6152) 2025-02-16 17:56:52 +01:00
Mike Fährmann
b4eae65965 [imhentai] avoid unnecessary HTTP request
no need to fetch a gallery's '/view/' page when the main page contains
all the same data as well
2025-02-16 15:04:24 +01:00
Mike Fährmann
800cf5beb5 replace 'print()' with 'output.stderr_write("\n")' 2025-02-15 18:01:05 +01:00