Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[readcomiconline] gallery-dl no longer downloads #5866

Closed
godlike64 opened this issue Jul 18, 2024 · 5 comments
Closed

[readcomiconline] gallery-dl no longer downloads #5866

godlike64 opened this issue Jul 18, 2024 · 5 comments

Comments

@godlike64
Copy link

It would seem that readcomiconline.li no longer works. gallery-dl does not seem to do anything:

$ gallery-dl -v "https://readcomiconline.li/Comic/BRZRKR/Issue-11?id=209334"
[gallery-dl][debug] Version 1.27.2-dev
[gallery-dl][debug] Python 3.11.8 - Linux-6.6.21-gentoo-x86_64-AMD_Ryzen_7_2700_Eight-Core_Processor-with-glibc2.38
[gallery-dl][debug] requests 2.31.0 - urllib3 2.2.1
[gallery-dl][debug] Configuration Files []
[gallery-dl][debug] Starting DownloadJob for 'https://readcomiconline.li/Comic/BRZRKR/Issue-11?id=209334'
[readcomiconline][debug] Using ReadcomiconlineIssueExtractor for 'https://readcomiconline.li/Comic/BRZRKR/Issue-11?id=209334'
[urllib3.connectionpool][debug] Starting new HTTPS connection (1): readcomiconline.li:443
[urllib3.connectionpool][debug] https://readcomiconline.li:443 "GET /Comic/BRZRKR/Issue-11?id=209334&quality=hq HTTP/1.1" 200 None
$ 

Have tried using the --cookies-from-browser parameter with both firefox and chrome values. Cookies are recognized but gallery-dl behaves the same.

Tested with current master as of just now.

@mikf
Copy link
Owner

mikf commented Jul 19, 2024

They changed their URL obfuscation code by quite a bit and, even worse, today's method is different from yesterday's. Let's hope this is going to stop at some point and it doesn't keep changing every day.

@godlike64
Copy link
Author

That's a bummer... I'll keep an eye on this issue, let me know if any testing might be needed.

@godlike64
Copy link
Author

Not sure if it will help, but I spent some time trying to figure out what it's doing. So the final images seem to be hosted on blogspot and their URL never changes, what changes every time you load the page is the horridly obfuscated javascript bit from the page (starting at line 569 here), e.g.:

<script type="text/javascript">
        var _4ZlJegF = new Array();
        var _xGEvMod = new Array();
        var pth = '';
        
        pth = '5jtSFp0RUR0E0aZdTJpeHIzMmQ3WjE0SFLTleRzH3YEMFpMIezppMVRHeFRJMEZhaVZLLUplVDRzdHo5Y0tKMHFpb0I0UzV0bDd0aWZwS0ttMTVzNHdCeG9xT1BwYzJMVXVIVDBVd1lJVlVHVUc1VEZrNGY5TVk3SkNkdUM4aFJGbF9mdmJPdXFXV1Z1Rng0UDF4UXBTV0I2em1Va2lWcmE5MER0Z3JbZaVwUJVgw=s0?rhlupa=OTUuMjIuMTEzLjEyOS43LzIwLzIwMjQgNjo1Mzo0MyBBTQ&rnvuka=TW96aWxsYS81LjAgKFgxMTsgTGludXggeDg2XzY0OyBydjoxMjUuMCkgR2Vja28vMjAxMDAxMDEgRmlyZWZveC8xMjUuMA';

        
        pth = pth.replace(/b/g, 'pw_.g28x');
        pth = pth.replace(/h/g, 'd2pr.x_27');
        _4ZlJegF.push('https://2.bp.blogspot.com/5jtSFp0RUR0E0aZdTJpeHIzMmQ3WjE0SFLTleRzH3YEMFpMIezppMVRHeFRJMEZhaVZLLUplVDRzdHo5Y0tKMHFpb0I0UzV0bDd0aWZwS0ttMTVzNHdCeG9xT1BwYzJMVXVIVDBVd1lJVlVHVUc1VEZrNGY5TVk3SkNkdUM4aFJGbF9mdmJPdXFXV1Z1Rng0UDF4UXBTV0I2em1Va2lWcmE5MER0Z3JbZaVwUJVgw=s0');
        _xGEvMod.push(pth);
       [...]

Further down dinnerTime(_xGEvMod, ''); is issued, this appears to be always issued on the second array that is defined. dinnerTime is defined in the previous custom script and uses the beau() function which seems to come from
https://readcomiconline.li/Scripts/rguard.min.js?v=1.5.1 (again, even more horridly obfuscated, although its content seems to be static).

In this particular example (the first image when loading the comic from the link I posted earlier), the link to the actual image is https://2.bp.blogspot.com/u2ixr32d7Z14HGxTI0FaiVK-JeT4stz9cKJ0qioB4S5tl7tifpKKm15s4wBxoqOPpc2LUuHT0UwYIVUGUG5TFk4f9MY7JCduC8hRFl_fvbOuqWWVuFx4P1xQpSWB6zmUkiVra90Dtg=s0?rhlupa=OTUuMjIuMTEzLjEyOS43LzIwLzIwMjQgNjo0NDowMSBBTQ&rnvuka=TW96aWxsYS81LjAgKFgxMTsgTGludXggeDg2XzY0OyBydjoxMjUuMCkgR2Vja28vMjAxMDAxMDEgRmlyZWZveC8xMjUuMA

Some parts of this link are present in the obfuscated code, but of course not everything. I'm guessing those pth.replace() calls plus whatever the beau() function does is what eventually generates the final URL for the browser to open, but I've been unable to reproduce it on a js console.

mikf added a commit that referenced this issue Jul 23, 2024
@mikf
Copy link
Owner

mikf commented Jul 23, 2024

Should be fixed for now, until they change things again: 7b445ec

@godlike64
Copy link
Author

Yup, it's fixed! Many thanks! I was about to whip up some Selenium crawler of my own xD

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants