gallery-dl

Author	SHA1	Message	Date
Mike Fährmann	9f3368c46f	[pornhub] fix 'user' metadata for gifs	2023-11-26 23:52:24 +01:00
Mike Fährmann	a453335a9f	remove test results in extractor modules and add generic example URLs	2023-09-11 16:30:55 +02:00
Mike Fährmann	a783c4f0fe	[pornhub] add 'gif' support (#4463 )	2023-08-29 19:34:27 +02:00
Mike Fährmann	088e8d5fcf	[pornhub] fix extraction (#4301 )	2023-07-22 14:05:40 +02:00
Mike Fährmann	d97b8c2fba	consistent cookie-related names - rename every cookie variable or method to 'cookies_*' - simplify '.session.cookies' to just '.cookies' - more consistent 'login()' structure	2023-07-22 01:20:50 +02:00
Mike Fährmann	20da41018d	[pornhub] set 'accessAgeDisclaimerPH' cookie (#4301 )	2023-07-14 14:30:27 +02:00
Mike Fährmann	6c8bf9a762	[pornhub] improve redirect handling (#4188 )	2023-06-15 16:32:53 +02:00
Vrihub	96fcff182c	generic extractor (#735 ) * Generic extractor, see issue #683 * Fix failed test_names test, no subcategory needed * Prefix directory_fmt with "generic" * Relax regex (would break some urls) * Flake8 compliance * pattern: don't require a scheme This fixes a bug when we force the generic extractor on urls without a scheme (that are allowed by all other extractors). * Fix using g: and r: on urls without http(s) scheme Almost all extractors accept urls without an initial http(s) scheme. Many extractors also allow for generic subdomains in their "pattern" variable; some of them implement this with the regex character class "[^.]+" (everything but a dot). This leads to a problem when the extractor is given a url starting with g: or r: (to force using the generic or recursive extractor) and without the http(s) scheme: e.g. with "r:foobar.tumblr.com" the "r:" is wrongly considered part of the subdomain. This commit fixes the bug, replacing the too generic "[^.]+" with the more specific "[\w-]+" (letters, digits and "-", the only characters allowed in domain names), which is already used by some extractors. * Relax imageurl_pattern_ext: allow relative urls * First round of small suggested changes * Support image urls starting with "//" * self.baseurl: remove trailing slash * Relax regexp (didn't catch some image urls) * Some fixes and cleanup * Fix domain pattern; option to enable extractor Fixed the domain section for "pattern", to pass "test_add" and "test_add_module" tests. Added the "enabled" configuration option (default False) to enable the generic extractor. Using "g(eneric):URL" forces using the extractor.	2021-12-29 22:39:29 +01:00
Mike Fährmann	47a780942c	update extractor test results	2021-09-03 19:36:12 +02:00
Mike Fährmann	bd08ee2859	remove most 'yield Message.Version' statements only leave them in oauth.py as noop results	2021-08-16 03:10:48 +02:00
Mike Fährmann	ae6748996a	[pornhub] update tests	2020-12-21 02:06:28 +01:00
Mike Fährmann	968d3e8465	remove '&' from URL patterns '/?&#' -> '/?#' and '?&#' -> '?#' According to https://www.ietf.org/rfc/rfc3986.txt, URLs are "organized hierarchically" by using "the slash ("/"), question mark ("?"), and number sign ("#") characters to delimit components"	2020-10-22 23:31:25 +02:00
Mike Fährmann	844793847c	update extractor test results	2020-10-11 18:15:41 +02:00
Mike Fährmann	c6c5cb1898	improve 'deviantart.quality' description	2019-08-30 18:41:18 +02:00
Mike Fährmann	c73c2cda50	[pornhub] add gallery & user extractor (#282 )	2019-06-07 16:31:20 +02:00

15 Commits