gpt-daily-arxiv

Description

gpt-daily-arxiv is a program that fetches papers using arxiv rss feeds and utilizes gpt to summarize the papers.

State Diagram

+---------------------+
| Arxiv RSS Feed      |
+---------------------+
          |
          v
+---------------------+
| Retrieve PDF        |
| from RSS Feed       |
+---------------------+
          |
          v
+---------------------+
| Convert PDF to Text |
+---------------------+
          |
          v
+---------------------+
| Ask GPT to Summarize|
| the Paper           |
+---------------------+
          |
          v
+---------------------+
| Write Data Record   |
| in MongoDB          |
+---------------------+

Usage

Ensure `OPENAPI_API_KEY` is setted. And if you are behind a proxy set environment variable `OPENAI_PROXY_URL` to your proxy server. Checkout this link:

To update RSS feeds

modify main.py below codes

arxiv_url_dict = {
    "Computer Vision": "https://arxiv.org/rss/cs.CV",
    "Computer Sicence": "https://arxiv.org/rss/cs",
    "Artificial Intelligence": "https://arxiv.org/rss/cs.AI",
    "Robotics": "https://arxiv.org/rss/cs.RO",
    "Software Engineering": "https://rss.arxiv.org/rss/cs.SE",
}

To run the project

Papers are download under ‘db’ folder

pip install -r requirements.txt
python3 main.py

# Also make sure mongodb is installed

Visualization

Since paper notes are stored in mongodb. I recommend using mongo-gui for visualization.

Further Works

[ ] dockerize this project
[ ] build frontend
[ ] support customize LLM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.org

README.org

gpt-daily-arxiv

Description

State Diagram

Usage

To update RSS feeds

To run the project

Visualization

Further Works

Files

README.org

Latest commit

History

README.org

File metadata and controls

gpt-daily-arxiv

Description

State Diagram

Usage

To update RSS feeds

To run the project

Visualization

Further Works