Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hyphenated query terms are NOTted out in the Search API #322

Open
erikspears opened this issue Aug 28, 2023 · 1 comment
Open

Hyphenated query terms are NOTted out in the Search API #322

erikspears opened this issue Aug 28, 2023 · 1 comment
Labels

Comments

@erikspears
Copy link

Describe the bug
When searching a hyphenated terms such as non-invasive via the Search API, the term after the hyphen is not present in any results. That second term is treated as though it were a Boolean NOT term.

To Reproduce
Steps to reproduce the behavior:

  1. From the Search API, search the term non-invasive
  2. You will likely get zero results (on 28-August-2023 this was the case)
  3. From the API, search the term non invasive
  4. You will get many results, and most/all will contain the term non-invasive, WITH the hyphen.
  5. You can repeat these steps with any hyphenated query term that should retrieve results normally.

Expected behavior
That searches through the API for the term non-invasive would return results that contain that phrase.

Screenshots
Two screenshots attached, both from Postman searches of the arXiv Search API. The first is for the term non-invasive and shows zero results returned (see the opensearch:totalResults value).
arxiv - zero results

The second is for the term non invasive and returns over 35K results.
arxiv - 35k results

Desktop (please complete the following information):

  • OS: MacOS 13.5.1
  • Browser: Brave
  • Version: v1.57.53 (Aug 23, 2023)

Smartphone (please complete the following information):

  • Device: [e.g. iPhone6]
    arxiv - zero results

  • OS: [e.g. iOS8.1]

  • Browser [e.g. stock browser, safari]

  • Version [e.g. 22]

Additional context
Add any other context about the problem here.

@erikspears erikspears added the bug label Aug 28, 2023
@ianabc
Copy link

ianabc commented Nov 17, 2023

I just stumbled into the same bug. Advanced search for report numbers with CERN-TH returns 6910 results, the API with rn:CERN-TH returns just 2. Also, I think #292 is the same thing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants