Commit Graph

10 Commits

Author SHA1 Message Date
Mike Fährmann
2be54be692 [subscribestar] merge 'user-tag' into regular 'user' extractor (#8737) 2025-12-23 18:58:25 +01:00
Mike Fährmann
609e19273d [subscribestar] add 'user-tag' extractor (#8737) 2025-12-21 22:14:17 +01:00
Mike Fährmann
c8fc790028 merge branch 'dt': move datetime utils into separate module
- use 'datetime.fromisoformat()' when possible (#7671)
- return a datetime-compatible object for invalid datetimes
  (instead of a 'str' value)
2025-10-20 09:30:05 +02:00
Mike Fährmann
d57dc48dcd [tests/results] replace 'datetime' usage
test for 'dt:…' to catch NullDatetime instances,
which inherit from 'datetime.datetime' but are not exactly this class
2025-10-17 17:06:39 +02:00
Mike Fährmann
36a3fe45e4 [subscribestar] improve 'filename' (#8416) 2025-10-15 11:52:39 +02:00
Mike Fährmann
98fdcd4d72 [subscribestar] fix 'content' extraction (#7486)
and extract 'tags' metadata

Authored by: prowlguru

Co-authored-by: prowlguru <183935626+prowlguru@users.noreply.github.com>
2025-05-10 21:04:27 +02:00
Mike Fährmann
4807bc215c [subscribestar] extract 'title' metadata (#7219) 2025-03-22 09:46:08 +01:00
Mike Fährmann
d110dfd2da [tests] update extractor results 2025-01-27 17:15:32 +01:00
Mike Fährmann
671297a8cc [subscribestar] extend fix + add test
some attachments are inside an element with an additional class besides
'doc_preview', e.g. 'class="doc_preview for_post"'
2025-01-02 18:22:15 +01:00
Mike Fährmann
a833c244c8 add exported extractor results 2023-09-10 14:45:01 +02:00