Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting connection refused #171

Open
stavikpetr opened this issue Dec 21, 2019 · 8 comments
Open

Getting connection refused #171

stavikpetr opened this issue Dec 21, 2019 · 8 comments

Comments

@stavikpetr
Copy link

Hello,

for about one month now, I am getting connection refused when using the tool. For example command pga list siva results in the following error message:

WARN[0001] could not check mod time in /home/stavik/.pga/siva/latest.index.csv.gz: Head http://pga.sourced.tech/csv/siva/latest.index.csv.gz: dial tcp 147.135.39.104:80: connect: connection refused
Error: could not open index file: could not copy to temporary file /home/stavik/.pga/siva/latest.index.csv.gz.tmp: Get http://pga.sourced.tech/csv/siva/latest.index.csv.gz: dial tcp 147.135.39.104:80: connect: connection refused

@anteos59
Copy link

anteos59 commented Jan 2, 2020

I'm getting almost the same error message:
Error: could not open index file: could not copy to temporary file C:\Users\Nutzer.pga\siva\latest.index.csv.gz.tmp: Get http://pga.sourced.tech/csv/siva/latest.index.csv.gz: dial tcp 147.135.39.104:80: connectex: No connection could be made because the target machine actively refused it.
Is there anything I could do? I would like to use PGA for my master thesis, so any help would be greatly appreciated.

@stavikpetr
Copy link
Author

anteos, I am in the same spot as you - I would also like to use PGA for my master thesis. Fortunately for me, I saved one dump of the tool just before it stopped working - it is the 173 MB dump of all repositories with more than 50 stars, which is basically the output of comman pga list siva. If you are interested in getting this dump, then contact me via email that you can find on my profile.

@vmarkovtsev
Copy link
Collaborator

Sorry for responding late (winter holidays). source{d} no longer exists, so the public datasets that had to be served from a dedicated server are all down. The tooling that depends on the server is essentially non-functional anymore. The files that are on Google Drive are still there and were additionally backed up, so they should continue to work. We were able to copy all the siva and UAST parquet files to Google Cloud Storage, however, we accidentally lost the PGA index file (CSV). If somebody has a copy, please send it to me.

The new company - Athenian - where some of the core devs migrated including me is sponsoring the GCS, however, public serving from there costs too much currently. I'll try to find an alternative way with @Guillemdb and @sergio-hcsoft who have recently bought the former data processing/ML cluster and have 80TB of free storage.

Now if you need a fraction of PGA, please send me the list of desired siva files, I will fetch them from GCS, package together and upload to Google Drive. Composing that list without an index is impossible though.

@Laura-lc
Copy link

Sorry for responding late (winter holidays). source{d} no longer exists, so the public datasets that had to be served from a dedicated server are all down. The tooling that depends on the server is essentially non-functional anymore. The files that are on Google Drive are still there and were additionally backed up, so they should continue to work. We were able to copy all the siva and UAST parquet files to Google Cloud Storage, however, we accidentally lost the PGA index file (CSV). If somebody has a copy, please send it to me.

The new company - Athenian - where some of the core devs migrated including me is sponsoring the GCS, however, public serving from there costs too much currently. I'll try to find an alternative way with @Guillemdb and @sergio-hcsoft who have recently bought the former data processing/ML cluster and have 80TB of free storage.

Now if you need a fraction of PGA, please send me the list of desired siva files, I will fetch them from GCS, package together and upload to Google Drive. Composing that list without an index is impossible though.

Hello vmarkovtsev,
I would like to get the metadata and source code (code diffs) of some of the projects in PGA for my master thesis. May I know how can I get those data? I emailed you before but got no response, may I know how can I contact you?
Thank you so much.

@giper45
Copy link

giper45 commented May 21, 2020

Hi, news about it ?

@darius-sas
Copy link

I am also interested in this. It's a pity that we cannot access this huge dataset :(

@dgrahn
Copy link

dgrahn commented Nov 17, 2020

@vmarkovtsev I would like a copy of any C/C++ repositories you have archived. I can provide a dropbox if you need it.

@peterzsj6
Copy link

Ops, so..... no more data?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants