Mike Fährmann
|
53cdfaac37
|
[common] add reference to 'exception' module to Extractor class
- remove 'exception' imports
- replace with 'self.exc'
|
2026-02-15 10:57:22 +01:00 |
|
Mike Fährmann
|
00c6821a3f
|
replace 2-element f-strings with simple '+' concatenations
Python's 'ast' module and its 'NodeVisitor' class
were incredibly helpful in identifying these
|
2025-12-22 11:26:04 +01:00 |
|
Mike Fährmann
|
e006d26c8e
|
Revert "use f-strings when building 'pattern'"
revert d7c97d5a97.
|
2025-12-20 22:07:37 +01:00 |
|
Mike Fährmann
|
968597a302
|
yield 3-tuples for Message.Directory
adapt tuples to the same length and semantics as other messages
|
2025-12-05 21:39:52 +01:00 |
|
Mike Fährmann
|
d7c97d5a97
|
use f-strings when building 'pattern'
|
2025-10-20 21:23:11 +02:00 |
|
Mike Fährmann
|
9bf76c1352
|
replace 'util.re()' with 'text.re()'
remove unnecessary 'util' imports
|
2025-10-20 17:44:58 +02:00 |
|
Mike Fährmann
|
c38856bd3f
|
[dt] use 'parse_datetime_iso()' for ISO formats
|
2025-10-19 21:52:05 +02:00 |
|
Mike Fährmann
|
085616e0a8
|
[dt] replace 'text.parse_datetime()' & 'text.parse_timestamp()'
|
2025-10-17 17:43:06 +02:00 |
|
Mike Fährmann
|
a097a373a9
|
simplify if statements by using walrus operators (#7671)
|
2025-07-22 20:57:54 +02:00 |
|
Mike Fährmann
|
d8ef1d693f
|
rename 'StopExtraction' to 'AbortExtraction'
for cases where StopExtraction was used to report errors
|
2025-07-09 21:07:28 +02:00 |
|
Mike Fährmann
|
9dbe33b6de
|
replace old %-formatted and .format(…) strings with f-strings (#7671)
mostly using flynt
https://github.com/ikamensh/flynt
|
2025-06-29 17:50:19 +02:00 |
|
Mike Fährmann
|
4f7a0e5d02
|
[arcalive] fix download URLs (#7678)
https://github.com/danbooru/danbooru/pull/6041
|
2025-06-18 21:12:18 +02:00 |
|
Mike Fährmann
|
b5c88b3d3e
|
replace standard library 're' uses with 'util.re()'
|
2025-06-06 13:24:52 +02:00 |
|
Mike Fährmann
|
5e07d30d91
|
[arcalive] replace 'ac-p.namu' subdomains with 'ac-o.namu' (#7556)
to prevent "403 Forbidden" errors when downloading
|
2025-05-24 22:12:15 +02:00 |
|
Mike Fährmann
|
83054325e0
|
[arcalive] allow overriding default User-Agent header (#7556)
|
2025-05-23 08:05:47 +02:00 |
|
Mike Fährmann
|
6894e0bc70
|
[arcalive] extend 'gifs' option
use fallback URLs by default
|
2025-03-26 20:59:18 +01:00 |
|
Mike Fährmann
|
dbe8820b9e
|
[arcalife] add 'gifs' option (#5657)
|
2025-03-14 22:34:45 +01:00 |
|
Mike Fährmann
|
31e57bafab
|
[arcalive] add 'user' extractor (#5657)
|
2025-03-14 18:58:10 +01:00 |
|
hdk5
|
d900e868e4
|
[arcalive] add support (#5657 #7100)
* [arca.live] Add extractor skeleton
* [arcalive] update names and formatting
* [arcalive] implement initial file extraction code
* [arcalive] improve '_extract_media()' performance
compile and cache regex on demand
* [arcalive] improve image extraction
- extract 'data-originalurl' URLs if available
- replace URL query strings with 'type=orig'
- ignore emoticons by default
* [arcalive] update defaults
- include 'title' in filenames
- use 0.5-1.5s delay between requests
* [arcalive] use ext from 'data-orig' if available
* [arcalive] update docs/supportedsites
* [arcalive] add tests
* [arcalive] update 'board' extractor pattern
so it doesn't also match 'post' URLs
---------
Co-authored-by: Mike Fährmann <mike_faehrmann@web.de>
|
2025-03-14 10:52:21 +01:00 |
|