Mike Fährmann
41191bb60a
'match.group(N)' -> 'match[N]' ( #7671 )
...
2.5x faster
2025-06-18 13:05:58 +02:00
pocketinternet
3ea244eebb
[webtoons] add 'thumbnails' option ( #6468 #7441 )
...
* Update webtoons.py
Added thumbnail download capability which defaults to false
* Update configuration.rst
Added documentation for webtoon thumbnail option
* extract thumbnails in GalleryExtractor.assets()
* simplify & fix flake8
* include 'type' in default filenames
* add test
* update docs
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2025-06-17 19:34:58 +02:00
Mike Fährmann
d0b1da0fa7
[tiktok] handle exceptions when extracting avatars ( #7682 )
2025-06-17 12:21:20 +02:00
Mike Fährmann
efa2ab5903
[common] allow GalleryExtractors to return additional asset files
...
like thumbnails or covers
2025-06-16 22:45:52 +02:00
SpiffyChatterbox
e0f65be36b
[nudostar] add support ( #5735 #6556 )
...
* Drafting initial basic extractor layout
* Better debug logging
* Update nudostar.py
Still tinkering
* Update nudostar.py
Basic extractor is working. Now starting on Gallery
* Update nudostar.py
Still a work in progress.
Got individual posts working, galleries are not.
* Update nudostar.py
* Site now appears working. Added Tests.
* PEP Updates
* PEP - Line Length Updates
* Update nudostar.py
Resolving PEP8 issues.
* update 'gallery' extractor, rename to 'model'
* update 'image' extractor
* expand tests
* update docs/supportedsites
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2025-06-16 19:21:49 +02:00
missionfloyd
13cb031fe2
[girlsreleased] add support ( #6200 )
...
* [girlsreleased] add support
* Lint
* Change "galleries" to "sets"
As it is on the site
* Add pagination
Add tests
* Fix tests
* Remove leftover print()
* Don't remove first set
* Yield pages
* Add filename metadata
* [girlsreleased] Refactor
* Return models as array
* Add filename numbering
Add date metadata
* Add URL metadata
* Spawn set extractor the right way
* Adjust model/site regex
* update
- restructure some code
- remove constructors
- use f-strings
* expand tests
* update docs/supportedsites
2025-06-16 19:18:19 +02:00
SpiffyChatterbox
48ac41605d
[redbust] add support ( #6759 #6918 #7043 )
...
* init - Redbust.com Support
* Added Test
Could use a second set of eyes on this
* update 'gallery' extractor
- extract more metadata
- simplify image extraction
- support legacy galleries
* add tests
* update 'image' extractor
* add 'tag' extractor
* add 'archive' extractor
* restrict 'image' extractor pattern
* update docs/supportedsites
* replace quotes inside f-string
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2025-06-16 12:10:42 +02:00
Mike Fährmann
d9432ee297
[warosu] restore correct 'now' values
2025-06-16 12:00:59 +02:00
NecRaul
3bc6bc7c77
[warosu] Single quotes when string has a quotation mark
2025-06-16 12:50:20 +04:00
NecRaul
f56e810f42
[warosu] Attribute fix
2025-06-16 12:33:16 +04:00
Mike Fährmann
0a3fac2dfe
merge #7664 : [archivedmoe] redirect URL fixes ( #7652 )
2025-06-15 10:03:34 +02:00
Mike Fährmann
b245218c1d
[archivedmoe] reword some comments and variable names
2025-06-15 10:00:45 +02:00
hunter-gatherer8
96f5cfb305
[girlswithmuscle] add support ( #4493 #6016 )
...
* [girlswithmuscle] init
* [girlswithmuscle]: fix metadata extraction (site layout change)
* [girlswithmuscle]: fix tags extraction (site layout change)
* update login code
* update 'post' extractor
* update 'gallery' extractor, rename to 'search' extractor
* update docs
* add test cases
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de >
2025-06-14 23:05:49 +02:00
Mike Fährmann
b583891df6
[pixiv] remove '/jump.php' from *all* AJAX caption links ( #4327 )
...
https://github.com/mikf/gallery-dl/issues/4327#issuecomment-2969765775
2025-06-13 21:10:51 +02:00
NecRaul
6668acf91e
[archivedmoe] Sort boards alphabetically
2025-06-13 19:29:47 +04:00
NecRaul
3a4e19d284
[archivedmoe] Simplify board extraction from url
2025-06-13 18:44:02 +04:00
NecRaul
a7aa18a8c1
[archivedmoe] remove unnecessary logging
2025-06-13 18:28:21 +04:00
NecRaul
8b2adeb41e
[archivedmoe] simplify board URL redirection logic
2025-06-13 18:26:39 +04:00
NecRaul
05081dea2e
Lint with flake8
2025-06-13 17:56:43 +04:00
NecRaul
223fe960a0
[archivedmoe] redirect URL changes (again)
...
Redirects to warosu.org instead of 4chan's cdn for certain boards
Redirects to archive.4plebs.org instead of 4chan's cdn for /tg/
Slices the filename only if it's redirecting to certain archives
2025-06-13 17:43:16 +04:00
Mike Fährmann
9d3cf67f3e
[pinterest] remove excess whitespace from 'description' fields ( #4335 )
...
and 'closeup_unified_description' & 'closeup_description'
2025-06-13 13:11:18 +02:00
Mike Fährmann
e08ec7e083
update copyright notices
2025-06-13 00:03:41 +02:00
Mike Fährmann
811b665e33
remove @staticmethod decorators
...
There might have been a time when calling a static method was faster
than a regular method, but that is no longer the case. According to
micro-benchmarks, it is 70% slower in CPython 3.13 and it also makes
executing the code of a class definition slower.
2025-06-12 22:50:52 +02:00
Mike Fährmann
e41c317039
[fantia] prevent '.html' file downloads
2025-06-12 20:41:47 +02:00
Mike Fährmann
e2d104a110
[twitter] extract 'source_id' and 'source_user' metadata ( #7470 #7640 )
2025-06-12 18:59:22 +02:00
Mike Fährmann
06e2f2cd91
[twitter] restructure media data extraction
2025-06-12 18:53:15 +02:00
Mike Fährmann
56ea27c474
[blogger] move original/s0 URL code into a separate function
2025-06-12 17:07:56 +02:00
Mike Fährmann
16fc5e0d68
[batoto] fix downloading manga with alerts/notices ( #7657 )
...
and improve alert message extraction
2025-06-12 08:26:26 +02:00
Mike Fährmann
a14671992c
[sexcom] prevent '.css' file downloads ( #7632 )
...
by detecting homepage redirects
and improve redirect handling in general
2025-06-11 22:32:08 +02:00
Mike Fährmann
0df083b208
[vk] prevent '404 Not Found' errors for file downloads
...
only strip query parameters when regex substitution applies
2025-06-11 22:32:08 +02:00
Mike Fährmann
d065452ba3
merge #7653 : [archivedmoe] fix redirection issue ( #7652 )
2025-06-11 20:04:42 +02:00
Mike Fährmann
80599fa610
[vk] fix 'user' metadata extraction
...
add boolean 'group' field
2025-06-11 20:01:27 +02:00
NecRaul
e3df99dbb9
Apply mikf's diff regarding Archived.moe
...
Moved (and refactored) code into remote()
Added a check for fixup_timestamp
2025-06-11 21:51:03 +04:00
Mike Fährmann
85931185a6
[vk] add continuation message ( #7650 )
2025-06-11 18:07:39 +02:00
Mike Fährmann
8287a1b372
[vk] detect redirects to 'challenge' pages ( #7650 )
2025-06-11 18:02:14 +02:00
NecRaul
4370654532
Simplify remote_media_link assignment
2025-06-11 04:49:21 +04:00
NecRaul
cb74d0f2f3
Lint with flake8
2025-06-11 04:46:18 +04:00
NecRaul
96bb2b1630
Fix Archived.moe redirection issue
...
Unless the board is /b/ (in which case redirection works fine),
remove the characters of the filename portion of the url until
filename portion of the url is 13 characters long (epoch millis).
2025-06-11 04:42:03 +04:00
Mike Fährmann
b4aed5e2c9
[common] allow overriding 'user-agent' when 'browser' is used ( #7647 )
2025-06-10 22:05:28 +02:00
Mike Fährmann
8e698d1a64
[ytdl] set domain as subcategory when using Generic extractor ( #6582 )
...
https://github.com/mikf/gallery-dl/issues/6582#issuecomment-2959879730
2025-06-10 21:35:15 +02:00
Mike Fährmann
4cfddc144a
[common] import 'datetime' class directly
2025-06-10 21:35:15 +02:00
Mike Fährmann
e68555defa
[common] improve cookie-related logging messages
2025-06-10 21:34:27 +02:00
Mike Fährmann
511cf2363c
[common] update expired cookie messages ( #7644 )
...
- prefix with 'cookies:'
- include domain
- include exact time when it expired
2025-06-09 18:48:04 +02:00
Mike Fährmann
5f41ac4257
[4archive] fix 'thread' extractor
2025-06-08 21:52:54 +02:00
Mike Fährmann
827eeca0bc
[paheal] fix '404 Not Found' for tags with URL encoded characters ( #7642 )
2025-06-08 16:23:11 +02:00
Mike Fährmann
17d39c06e3
[exhentai] implement '"source": "metadata"' ( #4902 )
2025-06-08 12:57:23 +02:00
Mike Fährmann
967af5eede
[exhentai] add 'limits-action' option ( #6504 )
...
https://github.com/mikf/gallery-dl/issues/6504#issuecomment-2949551532
2025-06-08 12:56:56 +02:00
Mike Fährmann
3b75b195c1
[exhentai] detect HTML downloads ( #4798 )
2025-06-07 22:06:53 +02:00
Mike Fährmann
27c48ad317
[exhentai] ensure file signature bytes aren't all zero ( #4902 )
2025-06-07 20:34:05 +02:00
Mike Fährmann
8227e21257
[deviantart:tiptap] fix TypeError when 'textAlign' is null ( #7639 )
2025-06-07 19:06:43 +02:00