Commit Graph

39 Commits

Author SHA1 Message Date
Mike Fährmann
7260e236d8 [furaffinity] fix 'favorite' pagination (#6151) 2024-09-06 08:52:05 +02:00
Mike Fährmann
c07a074418 [furaffinity] remove end of system messages 2024-08-07 18:39:34 +02:00
Mike Fährmann
846512f6cd [furaffinity] add 'submissions' extractor (#5954) 2024-08-07 16:34:32 +02:00
Mike Fährmann
9a8ec95398 [furaffinity] improve 2024-07-18 01:39:58 +02:00
Nicholas Bishop
4f3be2f571 Remove leftover split() statement
Forgot to remove this from a previous implementation attempt
2024-07-04 16:50:49 -04:00
Nicholas Bishop
f43bccb5be [furaffinity] Add 'thumbnail' (#1284) and 'folders' properties
Retrieve 'thumbnail' and 'folders' properties for each post.
'thumbnail' (#1284):
 - Preview image used for search results, writing posts, music, etc.
 - Filename format: <post_id>@600-<directory_containing_full_image>.jpg
'Folders' (related to #1817):
 - A list of all gallery folders containing this post
 - Folder name format: [<folder_category> - ]<folder_name>
 - Only works on new layout; old layout does not show folders, so list will be empty

A test is included for each property.
2024-07-04 15:41:14 -04:00
Delphox
11109d5bad [furaffinity] match xfuraffinity.com 2024-05-08 12:15:47 -03:00
Mike Fährmann
5aefa9465c [furaffinity] simplify pattern, match original domain first 2024-04-25 21:58:34 +02:00
Delphox
6969963125 [furaffinity] support fxfuraffinity and fxraffinity urls 2024-04-25 13:25:39 -03:00
Mike Fährmann
2cd801232b fix --range causing crashes (#4557)
regression caused by a383eca7
2023-09-22 16:28:20 +02:00
Mike Fährmann
a453335a9f remove test results in extractor modules
and add generic example URLs
2023-09-11 16:30:55 +02:00
Mike Fährmann
a383eca7f6 decouple extractor initialization
Introduce an 'initialize()' function that does the actual init
(session, cookies, config options) and can called separately from
the constructor __init__().

This allows, for example, to adjust config access inside a Job
before most of it already happened when calling 'extractor.find()'.
2023-07-25 22:16:16 +02:00
Mike Fährmann
d97b8c2fba consistent cookie-related names
- rename every cookie variable or method to 'cookies_*'
- simplify '.session.cookies' to just '.cookies'
- more consistent 'login()' structure
2023-07-22 01:20:50 +02:00
Mike Fährmann
b6c959744d [furaffinity] improve 'description' HTML (#4224)
- ignore header
- include footer and closing <div> if present
2023-06-24 20:30:40 +02:00
Mike Fährmann
5e3a1749c8 [furaffinity] simplify 'favorite_id' assignment 2023-06-09 16:35:59 +02:00
Bad Manners
952c03bc9e Add fav_id data to FuraffinityFavoriteExtractor
An extra field is collected when paginating favorites, and saved to
a temporary cache variable. This field is identical for both the old
and the new page layouts for FurAffinity, but can only be collected
during pagination, hence the cache variable. Other FurAffinity
extractors should be unaffected by this change.
2023-06-02 19:53:47 -03:00
Mike Fährmann
b0cb4a1b9c replace 'text.extract()' with 'text.extr()' where possible 2022-11-05 01:14:09 +01:00
Mike Fährmann
c4cc387f7d [furaffinity] fix search result pagination (fixes #2402) 2022-03-18 13:44:36 +01:00
Mike Fährmann
f1c853c6ef [furaffinity] add 'layout' option (#2277)
to be able to force gallery-dl to parse according to a specific layout
in case its auto-detect fails
2022-02-11 00:28:47 +01:00
Mike Fährmann
4efe56f419 [furaffinity] improve new/old layout detection (fixes #2277) 2022-02-08 18:10:52 +01:00
Mike Fährmann
87ce3fa669 [furaffinity] warn when no session cookies were found 2021-12-15 16:21:05 +01:00
Mike Fährmann
975e0a4fe0 [furaffinity] unquote search queries (#1958)
instead of unescape
(unquote -> url params, unescape -> html entities)
2021-10-17 20:48:08 +02:00
Mike Fährmann
6b229ac829 [furaffinity] expand URL pattern for searches (closes #1780) 2021-08-23 23:38:52 +02:00
Mike Fährmann
a5de2244d4 [furaffinity] fix using 'category-tranfer' (#1274) 2021-08-08 23:50:32 +02:00
Mike Fährmann
5b1c62bfa9 [furaffinity] add 'external' option (closes #1492) 2021-08-08 19:16:02 +02:00
Mike Fährmann
4adc44df69 [furaffinity] improve metadata extraction (fixes #1630)
Fetch 'title' and 'artist' metadata from a different location,
since for posts with an empty title the <title> element is
completely empty and does not contain the artist's name.
2021-06-19 03:29:00 +02:00
Mike Fährmann
2919d78bfc update extractor test results 2021-02-14 15:37:39 +01:00
Mike Fährmann
89a2bcbb2d [furaffinity] add 'descriptions' option (#1231) 2021-01-19 19:09:29 +01:00
Mike Fährmann
968d3e8465 remove '&' from URL patterns
'/?&#' -> '/?#' and '?&#' -> '?#'

According to https://www.ietf.org/rfc/rfc3986.txt, URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
2020-10-22 23:31:25 +02:00
Mike Fährmann
844502cad5 update extractor test results 2020-10-03 19:24:19 +02:00
Mike Fährmann
dcf3ad7eef [furaffinity] update download URL extraction (fixes #988)
support the new 'd2.facdn.net' subdomain
2020-09-11 13:23:57 +02:00
Mike Fährmann
063c71cd84 [furaffinity] add 'search' extractor (closes #915) 2020-08-18 21:26:46 +02:00
Mike Fährmann
ffb6c5277a [furaffinity] add 'artist_url' metadata field (closes #821) 2020-06-11 18:36:24 +02:00
Mike Fährmann
fa2952ac55 [furaffinity] add 'following' extractor (#515) 2020-04-17 22:18:39 +02:00
Mike Fährmann
d3482ace7f [furaffinity] extract more metadata
- views
- favorites
- comments
- rating
- fa_category (since 'category' is already in use)
- theme
- species
- gender
- width
- height
2020-03-13 23:56:55 +01:00
Mike Fährmann
4e361b3008 add tests for specific datetime values 2020-02-23 16:48:30 +01:00
Mike Fährmann
650f2b6d58 [furaffinity] accept sfw.furaffinity.net URLs (closes #608)
Just as an alias for regular URLs with no extra content filtering.
2020-02-15 22:47:12 +01:00
Mike Fährmann
c7cf9dd111 [furaffinity] support classic layout (#284) 2020-02-12 21:39:43 +01:00
Mike Fährmann
138135c190 [furaffinity] add extractors (#284) 2020-02-11 19:51:24 +01:00