Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Status of open science in Kenya: Data mining #1

Open
kipkurui opened this issue Aug 14, 2018 · 11 comments
Open

Status of open science in Kenya: Data mining #1

kipkurui opened this issue Aug 14, 2018 · 11 comments
Assignees
Labels
discussion needed help wanted Extra attention is needed

Comments

@kipkurui
Copy link
Contributor

kipkurui commented Aug 14, 2018

Explore the status of open science in Kenya through literature search and data mining. The idea is to analyze how open science tools have been used in research. For example, for a given period of time, we can explore how often published work in Kenya provides access to data and code. Where R has been used for data analysis, is the code provided, and through which means?

We could perform data analysis using R or Python. Tools like ContentMine could be used for this project. They have a tool for downloading papers: get papers.

Let's discuss.

@kipkurui kipkurui added help wanted Extra attention is needed discussion needed labels Aug 14, 2018
@kipkurui kipkurui changed the title Status of open science in Kenya Status of open science in Kenya: Data mining Aug 20, 2018
@kipkurui
Copy link
Contributor Author

We'll also explore using pubmed.mineR to perform text mining on Pubmed Abstracts. Used to extract these data by Geoffrey Siwo

@Shuyib
Copy link

Shuyib commented Aug 22, 2018

This is great! I'll go through the vignette. I was intending to search for keywords associated with open science research papers i.e take a subset of a couple of papers then we cluster those as a start as well as the technologies used + text mining the content of all the papers with an app i've made + searching for trends about our keywords across the internet.

@karegapauline
Copy link

karegapauline commented Aug 22, 2018 via email

@esohkevin
Copy link
Contributor

NCBI has also developed a tool (https://github.com/esohkevin/OpenScienceKEHackathon/blob/master/EDirect.md) that is just as robust in text mining. It can be used to access all NCBI databases

@kipkurui
Copy link
Contributor Author

Awesome, thanks @esohkevin and @Shuyib. I like how this is shaping up!

@ousodaniel
Copy link

ousodaniel commented Aug 24, 2018 via email

@ousodaniel
Copy link

ousodaniel commented Aug 24, 2018 via email

@kmut2030
Copy link

Good work, sorry i've been away but now I'm back ...

@kipkurui
Copy link
Contributor Author

Welcome Kelvin. Your presence has been missed.

@mgawe-cavin
Copy link

I registered with BMC-bioinformatics journal, I have been getting updates whenever any paper is published. There are a lot of codes and bioinformatics algorithms being published but majority of authors of these papers are not from kenya. A lot of R packages code can be pulled from this site (BMC-bioinformatics methods).

@Shuyib
Copy link

Shuyib commented Aug 30, 2018

Biopython can also be used for search however, it's more inclined for searching for information about specific things for instance genes, organisms. On the other hand, i found if your searching for abstracts from your PC then it's simple. I'll just point it out.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion needed help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

7 participants