-
Notifications
You must be signed in to change notification settings - Fork 0
feat: ✨ add calendar scraper and dump endpoint #132
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I cannot discern where the issue might be in the code as I have little to no experience with webscraping, but the output for the finals start and end fields is identical.
I'm not sure where that issue could be sourced, but the output is wrong there. Otherwise looks great, thank you so much!
Update for posterity: It was, in fact, not "just a data integrity problem"; the algorithm was inherently ill-suited to handle the myriad edge cases with the formatting of the UCI Registrar's Quarterly Academic Calendar, so I rewrote the calendar library using a more sane algorithm that locates the entries based on the keywords, rather than assuming that their locations relative to the table's first entry will remain constant. This should also make it more extensible should we desire additional fields down the line. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Behavior is as expected. Thank you so much for this endpoint!
Summary
/v1/rest/calendar
with no params will now dump all terms in the database. The GraphQL queryallTermDates
accomplishes the same.socAvailable
to the response type, which indicates when the Schedule of Classes will become available for that term.Issues
Closes #111; closes #124.