46 Commits

Author SHA1 Message Date
Mike Fährmann
53cdfaac37 [common] add reference to 'exception' module to Extractor class
- remove 'exception' imports
- replace with 'self.exc'
2026-02-15 10:57:22 +01:00
Mike Fährmann
40e4cc62c4 [common] pass job status to 'finalize()' 2026-02-07 21:47:17 +01:00
Mike Fährmann
00c6821a3f replace 2-element f-strings with simple '+' concatenations
Python's 'ast' module and its 'NodeVisitor' class
were incredibly helpful in identifying these
2025-12-22 11:26:04 +01:00
Mike Fährmann
e006d26c8e Revert "use f-strings when building 'pattern'"
revert d7c97d5a97.
2025-12-20 22:07:37 +01:00
Mike Fährmann
968597a302 yield 3-tuples for Message.Directory
adapt tuples to the same length and semantics as other messages
2025-12-05 21:39:52 +01:00
Mike Fährmann
d7c97d5a97 use f-strings when building 'pattern' 2025-10-20 21:23:11 +02:00
Mike Fährmann
9bf76c1352 replace 'util.re()' with 'text.re()'
remove unnecessary 'util' imports
2025-10-20 17:44:58 +02:00
Mike Fährmann
085616e0a8 [dt] replace 'text.parse_datetime()' & 'text.parse_timestamp()' 2025-10-17 17:43:06 +02:00
Mike Fährmann
752f82f002 [vk:album] extract more metadata (#8029)
… when possible / allowed by VK

- album[name]
- album[count]
- user[name]
- user[nick]
- user[group]
2025-09-05 19:33:57 +02:00
Mike Fährmann
821f5c9a9e [vk] parse 'description' metadata (#8029) 2025-09-05 19:27:25 +02:00
Mike Fährmann
89e1c785a6 [vk] parse 'date' metadata (#8029) 2025-09-05 18:04:34 +02:00
Vitaliy Levin
61d793dc7d [vk] add 'wall-post' extractor (#474 #6378 #8159)
* [vk] Added extractor for VK wall posts
* update
    - fix flake8
    - rename to 'wall-post'
    - remove __init__() / use self.groups
    - simplify 'description' extraction
* add test
* add to docs/supportedsites

---------

Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
2025-09-05 13:10:08 +02:00
Mike Fährmann
d54c35a0d3 [vkj] update default 'archive_fmt' (#8030)
include 'user[id]' to avoid duplicate IDs
2025-08-14 08:07:29 +02:00
Mike Fährmann
a097a373a9 simplify if statements by using walrus operators (#7671) 2025-07-22 20:57:54 +02:00
Mike Fährmann
d8ef1d693f rename 'StopExtraction' to 'AbortExtraction'
for cases where StopExtraction was used to report errors
2025-07-09 21:07:28 +02:00
Mike Fährmann
9dbe33b6de replace old %-formatted and .format(…) strings with f-strings (#7671)
mostly using flynt
https://github.com/ikamensh/flynt
2025-06-29 17:50:19 +02:00
Mike Fährmann
41191bb60a 'match.group(N)' -> 'match[N]' (#7671)
2.5x faster
2025-06-18 13:05:58 +02:00
Mike Fährmann
e08ec7e083 update copyright notices 2025-06-13 00:03:41 +02:00
Mike Fährmann
0df083b208 [vk] prevent '404 Not Found' errors for file downloads
only strip query parameters when regex substitution applies
2025-06-11 22:32:08 +02:00
Mike Fährmann
80599fa610 [vk] fix 'user' metadata extraction
add boolean 'group' field
2025-06-11 20:01:27 +02:00
Mike Fährmann
85931185a6 [vk] add continuation message (#7650) 2025-06-11 18:07:39 +02:00
Mike Fährmann
8287a1b372 [vk] detect redirects to 'challenge' pages (#7650) 2025-06-11 18:02:14 +02:00
Mike Fährmann
b5c88b3d3e replace standard library 're' uses with 'util.re()' 2025-06-06 13:24:52 +02:00
Mike Fährmann
0e4e40c9d2 [vk] document 'offset', add '--range' support 2024-10-17 21:20:21 +02:00
Sergey Kolesnik
67be377c77 Update vk.py: add "offset" option 2024-10-16 02:07:58 +03:00
Mike Fährmann
db8de13537 [vk] transform image URLs to non-blurred versions (#5017)
apply the same filter from before d85e66bc
2024-01-07 00:19:10 +01:00
Mike Fährmann
1f9b16a70b replace static 'sleep-request' defaults with dynamic ones 2023-12-18 22:06:26 +01:00
Mike Fährmann
a453335a9f remove test results in extractor modules
and add generic example URLs
2023-09-11 16:30:55 +02:00
Mike Fährmann
ed2d715019 fix 'keywords' in extractor tests (#3491) 2023-01-03 15:14:23 +01:00
Mike Fährmann
560f7b41d8 [vk] add 'tagged' extractor (#2997) 2022-10-08 15:41:39 +02:00
Mike Fährmann
122e1a467a [vk] unescape error messages 2022-10-08 15:41:39 +02:00
Mike Fährmann
e660e48a60 [vk] prevent exceptions for broken/invalid photos (#2774) 2022-07-27 18:52:43 +02:00
Mike Fährmann
367a491128 [vk] get URLs from *_src entries (#2535)
https://github.com/mikf/gallery-dl/issues/2535#issuecomment-1166566986
2022-06-27 10:24:35 +02:00
Mike Fährmann
435e9c5d2e [vk] report errors for private albums (#2556) 2022-05-04 17:34:50 +02:00
Mike Fährmann
3e6aba05ab [vk] add fallback for user ID extraction (#2535) 2022-05-03 13:42:45 +02:00
Mike Fährmann
5b7423d14c [vk] fix URLs for older photos (#2535) 2022-05-02 11:19:18 +02:00
Mike Fährmann
a39e7b7366 [vk] handle photos without width/height info (fixes #2535) 2022-05-02 09:03:00 +02:00
Mike Fährmann
d85e66bcac [vk] fix extraction (#2512)
Use a different API endpoint, since thumbnail URLs from the old one
cannot be transformed into URLs for "original" photos anymore.
2022-04-21 14:01:50 +02:00
Mike Fährmann
1f9a0e2fd8 update extractor test results 2022-04-18 17:24:00 +02:00
Mike Fährmann
655b2de5d9 [vk] fix infinite pagination loops (fixes #2297) 2022-02-13 23:00:50 +01:00
Mike Fährmann
dee0d22561 update extractor test results 2022-02-06 21:39:24 +01:00
Mike Fährmann
c19e762fdf [vk] add 'album' extractor (#474, fixes #1952)
todo: better metadata for albums
2021-10-23 00:46:20 +02:00
Mike Fährmann
fe970fc87f [vk] prevent exception for empty/private profiles (fixes 1742) 2021-08-02 15:38:51 +02:00
Mike Fährmann
36a2aff363 [vk] improve metadata extraction and URL pattern (fixes #1691)
- always fetch all user metadata
- use 'user[name]' for directory names if available
2021-07-15 00:43:42 +02:00
Mike Fährmann
ec3d5d58a8 [vk] improve extractor (#474)
- fetch all photos
- add 'metadata' option
- fix extracting photos without '?' in URL
2021-04-01 14:35:56 +02:00
Mike Fährmann
62cfee4d28 [vk] initial support for albums (#474) 2021-03-23 19:02:16 +01:00