-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Address I18N-ACTION-90 by adding text from scroll-to-text-fragment#233 #23
Conversation
This PR includes borrowing text from an example by @hsivonen, which I intend to replace before merging with a better-adapted version. In addition, some of the text or comments from WICG/scroll-to-text-fragment#233 are begin adapted into the prose of this document. **_Submitting as draft. Not ready for review._**
✅ Deploy Preview for w3c-string-search ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
This is a fairly significant refactoring of the introduction to section 2, plus a retitle of the equivalences section. I moved the Henri Sivonen example into an example and began refactoring it (but didn't finish it--it needs to be more illustrative) More to follow, but committing this for visibility/discussion.
@hsivonen for visibility. We borrowed your example and adapted it (you are credited in the document). Your comments would be very welcome. |
@@ -96,59 +96,111 @@ <h3>Terminology</h3> | |||
<p class="definition"><dfn data-lt="full text search|full-text search|full text searching">Full-Text Search</dfn> refers to searches that process the entire contents of the textual document or set of documents. Full-text queries perform linguistic searches against text data in full-text indexes by operating on words and phrases based on the rules of a particular language such as English or Japanese. Full-text queries can include simple words and phrases or multiple forms of a word or phrase.</p> | |||
<p>Frequently this means that a <a>full-text search</a> employs indexes and natural language processing. When you are using a search engine, you are using a form of full text search. Full text search often breaks natural language text into words or phrases (this is called <a>segmentation</a>) and may apply complex processing to get at the semantic "root" values of words (this is called <a>stemming</a>). These processes are sensitive to language, context, and many other aspects of textual variation.</p> | |||
|
|||
<p class="definition"><dfn data-lt="natural language processing|NLP">Natural Language Processing</dfn> (<abbr title="natural language processing">NLP</abbr>) refers to the domain of software designed to understand, process, and manipulate human languages (that is, <a>natural language</a>). This is a very wide ranging term. It can cover relatively simple problems, such as word tokenization, or more complex behaviors, such as deriving "meaning" from text, recognizing parts of speech, performing accurate translation, and much else.</p> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can consider adding this to the glossary?
Since many aspects of internationalization are related to NLP (tense conversion, pluralization, singularization, parsing the structure of personal names, converting to comparative or superlative, lemmatization and stemming, and so on and so forth), not just string searching, we may also cite this term in other documents.
Co-authored-by: Fuqiao Xue <[email protected]>
- Convert Turkish examples to a table with code points - Make the en/de/fi examples use two columns - Add a hint for the reader - Add a title to the example
This PR borrows and adapts examples by Henri Sivonen. In the course of doing this change, the head matter of the document was revised extensively.