Commit Graph

3089 Commits

Author SHA1 Message Date
Mike Fährmann
532ac79fb0 update extractor test results 2021-05-21 02:28:53 +02:00
Mike Fährmann
d7bc4a2b8b [500px] update query hashes 2021-05-21 01:20:31 +02:00
Mike Fährmann
0f35aca728 [aryion] minor code updates 2021-05-19 23:46:33 +02:00
Mike Fährmann
2eb46452ad [aryion] update 'needle' to not skip text posts (fixes #1568)
on "Latest Updates" pages

"class='thumb scrollthumb' href='/g4/view/" and
"class='thumb' href='/g4/view/" both end with
"thumb' href='/g4/view/"
2021-05-19 23:35:05 +02:00
Mike Fährmann
adf4d661b3 use '_extractor' info in UrlJobs 2021-05-19 15:52:30 +02:00
Mike Fährmann
4fc9668922 [imgur] update URL patterns (#1561) 2021-05-19 15:44:10 +02:00
Mike Fährmann
1eabfa5c7a [pillowfort] implement login with username & password (#846) 2021-05-19 02:59:16 +02:00
Mike Fährmann
24dd10ac3c [patreon] extract user defined 'tags' (#1539, closes #1540) 2021-05-18 00:35:52 +02:00
Mike Fährmann
a7e4917ee1 [pillowfort] add 'inline' option (#846)
to support images present in a post's 'content',
but not listed in 'media'.

also separates the file hash present at the beginning
of each 'filename' into its own field.
2021-05-17 03:03:58 +02:00
Mike Fährmann
efa6cc8ec3 [pillowfort] add 'external' option (#846)
for links to external Twitter posts etc.
2021-05-17 01:46:42 +02:00
Mike Fährmann
394fbb5f56 [twitter] strip useless t.co links (#1532)
The 'full_text' of Tweets with media content usually ends with a t.co
link to itself. This commit removes those.
2021-05-17 00:20:29 +02:00
Mike Fährmann
3a7c3ff138 support XDG_CONFIG_HOME (closes #1545)
This will only load either
${XDG_CONFIG_HOME}/gallery-dl/config.json or
${HOME}/.config/gallery-dl/config.json
if XDG_CONFIG_HOME is empty, never both.
2021-05-16 02:48:16 +02:00
Mike Fährmann
41457dbb1b [twitter] resolve t.co URLs in 'content' (#1532) 2021-05-15 18:52:37 +02:00
Mike Fährmann
2b5d80862e [kemonoparty] add 'type' metadata field (#1556)
'file', 'attachment', or 'inline'
2021-05-15 01:13:41 +02:00
Mike Fährmann
17b0ccb071 [twitter] add missing retweet media entities (fixes #1555)
from the original tweets
2021-05-14 22:51:01 +02:00
Mike Fährmann
5eeaaee01d [pixiv] add 'metadata' option (#1551) 2021-05-14 20:30:28 +02:00
Mike Fährmann
0717456b4e [kemonoparty] add 'metadata' option (closes #1548)
to fetch creator names with an additional HTTP request
2021-05-14 19:56:49 +02:00
Mike Fährmann
559462789d add some tests for job.py 2021-05-14 19:44:16 +02:00
Mike Fährmann
b50b8e6cf4 refactor applying 'parent-…' options 2021-05-13 21:56:34 +02:00
Mike Fährmann
7ab8374385 add 'parent-skip' option (#1399) 2021-05-13 16:40:04 +02:00
Mike Fährmann
c693db5b1a add '"skip": "terminate"' option
Stops not only the current extractor/job,
but all parent extractors/jobs as well.
2021-05-12 02:22:28 +02:00
Mike Fährmann
4835888acc release version 1.17.4 2021-05-07 21:37:48 +02:00
Mike Fährmann
36ed1efcfb [pixiv] rename "noop" value for 'tags' option to "original"
(#1507)
2021-05-07 20:41:54 +02:00
Mike Fährmann
4e1d33cea3 update default gallery-dl.conf
add (empty) entries for
- deviantart.client-id and deviantart.client-secret
- pixiv.refresh-token
2021-05-07 16:17:36 +02:00
Mike Fährmann
14f983eab6 [deviantart] use default ID when 'client-id' is None 2021-05-07 16:14:38 +02:00
Mike Fährmann
3e4ffb0821 [gelbooru] add extractor for '/redirect.php' URLs (#1530) 2021-05-07 15:34:53 +02:00
Mike Fährmann
5e54105ae4 [instagram] update query hashes 2021-05-06 19:15:18 +02:00
Mike Fährmann
b3ee10a7fb [500px] update query hashes 2021-05-06 17:28:26 +02:00
Mike Fährmann
15b0241bbc [imagebam] fix extraction 2021-05-06 16:47:36 +02:00
Mike Fährmann
38ae61edd4 [inkbunny] add 'favorite' extractor (#1521) 2021-05-04 19:28:48 +02:00
Mike Fährmann
577fffad5f [nozomi] update 'archive_fmt' values for tag and search extractors
… so they actually work for posts with more than 1 file.

(fixes #1523)
2021-05-04 19:28:37 +02:00
Mike Fährmann
e300da1424 add 'output.skip' option 2021-05-04 19:27:18 +02:00
Mike Fährmann
c5ca7905ce add 'noop()' and 'identity()' functions 2021-05-04 19:27:17 +02:00
Mike Fährmann
755164b36a improve --clear-cache (#1230)
Allow for an optional argument to only delete cached entries from
a specific module.

delete all cache entries
$ gallery-dl --clear-cache
or
$ gallery-dl --clear-cache all

only delete entries for instagram
$ gallery-dl --clear-cache instagram
2021-05-04 19:27:17 +02:00
HRXN
e13cae182b [nozomi] Extend default archive-fmt for Tag and Search Extractor (#1529)
Closes #1523
2021-05-04 19:26:35 +02:00
Mike Fährmann
bc868e7bb8 consider apparently long extensions as part of the filename
(#1516)
2021-05-02 21:15:50 +02:00
Mike Fährmann
2133f1d77f [readcomiconline] change domain to 'readcomiconline.li'
(closes #1517)
2021-05-01 16:41:16 +02:00
Mike Fährmann
66f28e471c [kemonoparty] update file URLs directly linking to kemono.party
(#1514)
2021-05-01 02:30:10 +02:00
Mike Fährmann
6fa20d456b [sankaku] update invalid-token detection (fixes #1515) 2021-04-30 22:04:45 +02:00
Mike Fährmann
4b65ebf652 [kemonoparty] fix file URLs (#1514)
files are now hosted on https://data.kemono.party/
2021-04-29 19:36:34 +02:00
Mike Fährmann
fa519f9202 [pixiv] change 'translated-tags' option (#1507)
- rename to 'tags'
- use string-values: "japanese", "translated", "noop"
- remove duplicate entries for "translated" tags
2021-04-29 19:30:43 +02:00
Mike Fährmann
5b4da4b4bf reorder config access in Job constructor
(#1111)
2021-04-27 15:12:59 +02:00
Mike Fährmann
221015e586 [downloader:http] disable filename extension changes for ugoira
(#1507)
2021-04-27 01:29:09 +02:00
Mike Fährmann
e5123f56c9 fix crash when using --no-download with --ugoira-conv (#1507) 2021-04-26 23:35:44 +02:00
Mike Fährmann
07b6661a87 release version 1.17.3 2021-04-25 21:23:26 +02:00
Mike Fährmann
c6c4a73f87 update fanbox entry in supportedsites.md 2021-04-25 19:44:19 +02:00
thatfuckingbird
e47952ac14 add extractors for fantia and fanbox (#1459)
* add extractors for fantia and fanbox

* appease linter

* make docstrings unique

* [fantia] refactor post extraction

* [fantia] capitalize

* [fantia] improve regex pattern

* code style

* capitalize

* [fanbox] use BASE_PATTERN for url regexes

* [fanbox] refactor metadata and post extraction

* [fanbox] improve url base pattern

* [fanbox] accept creator page links ending with /posts

* [fanbox] more tests

* [fantia] improved pagination

* [fanbox] misc. code logic improvements

* [fantia] finish restructuring pagination code

* [fanbox] avoid making a request for each individual post when processing a creator page

* [fanbox] support embedded videos

* [fanbox] fix errors

* [fanbox] document extractor.fanbox.videos

* [fanbox] handle "article" and "entry" post types, all embeds

* [fanbox] fix downloading of embedded fanbox posts
2021-04-25 19:39:13 +02:00
Mike Fährmann
d900edfcfb [simplyhentai] fix extraction 2021-04-25 18:51:43 +02:00
Mike Fährmann
ba8180b5e6 [bcy] don't crash with deleted posts 2021-04-25 18:51:09 +02:00
Mike Fährmann
d108421461 [myportfolio] fix extraction 2021-04-24 01:22:57 +02:00