seiso.party support #1635

thatfuckingbird · 2021-06-17T21:57:17Z

A kemono.party fork/rewrite(?). Not sure how much of the kemono code could be reused.

Some info from their admin:

Main difference isn't in the UI, but in the code that runs the backend and the importer. It was almost completely rewritten (90%+) and the importer is much more reliable when it comes to embedded things and weird formats. This leads to higher quality content on the site.

Additionally, the storage that Seiso uses is a lot different from Kemono's storage which means that it won't buckle under load even when it gets to the amount of traffic that Kemono has. Images should always load regardless of traffic, even large uncached ones.

That's the gist of it.

mikf · 2021-06-18T15:09:04Z

The site doesn't appear to have a convenient API like kemono.party does, so not much of the current code can be reused, I think. Maybe some from the initial kemono commit that manually parsed HTML, but I doubt it.

Do you know where to find the site's code or any form of documentation for an eventual API?

thatfuckingbird · 2021-06-18T16:25:02Z

https://paywall.party/seiso/catalog.html is all the info I've found. There is a post from the admin that it might be made open source later but right now it is not. No mention of API, looks like we are out of luck for that.

Looking at the source, parsing the HTML of artist galleries shouldn't be too bad. The individual post pages aren't too bad either, looks like all the files we want have URLs beginning with cdn.seiso.party/files/.

Other than those, extracting the post title and text would be nice, especially that the post html can contain relevant links (e.g. to google drive or other file hosters).

mikf · 2021-06-26T16:56:58Z

Initial support got added in f74cf52.
It behaves more or less just like the kemono.party extractors as in:

it largely provides the same metadata fields
it uses the same filename/directory/archive format strings be default
it also needs cookies to get around DDOS-Guard (Kemono: 403 Forbidden #1370)

It also always provides username information without enabling a metadata option. This should probably be used instead of the user ID from user, since that doesn't reflect the real ID like it does on kemono.

thatfuckingbird · 2021-06-26T18:07:58Z

Thank you, appreciate your work a lot! Now I can scratch this off my TODO list.

mikf · 2021-06-29T19:06:21Z

Quick update:

files to cdn-2 servers now also get recognized (e4db1ba)
changed the default directory names to use usernames instead of IDs (daf821b)
added a warning when ddos-guard cookies are missing (344aab3)

mikf added nsfw site:support labels Jun 18, 2021

mikf added a commit that referenced this issue Jun 26, 2021

[seisoparty] add 'user' and 'post' extractors (#1635)

f74cf52

thatfuckingbird closed this as completed Jun 26, 2021

mikf added a commit that referenced this issue Jun 29, 2021

[seisoparty] also extract files hosted on 'cdn-2' servers (#1635)

e4db1ba

mikf added a commit that referenced this issue Jun 29, 2021

[seisoparty] use user names instead of IDs by default (#1635)

daf821b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seiso.party support #1635

seiso.party support #1635

thatfuckingbird commented Jun 17, 2021

mikf commented Jun 18, 2021

thatfuckingbird commented Jun 18, 2021

mikf commented Jun 26, 2021 •

edited

Loading

thatfuckingbird commented Jun 26, 2021

mikf commented Jun 29, 2021

seiso.party support #1635

seiso.party support #1635

Comments

thatfuckingbird commented Jun 17, 2021

mikf commented Jun 18, 2021

thatfuckingbird commented Jun 18, 2021

mikf commented Jun 26, 2021 • edited Loading

thatfuckingbird commented Jun 26, 2021

mikf commented Jun 29, 2021

mikf commented Jun 26, 2021 •

edited

Loading