for docs on approaches utilized please refer to this link
Progress
-
Front End
- Home Page
- Authentication
- Single User
- Organization
- Comparison page ( For both Org and Individuals )
- Multiple sentence A and Multiple Sentence B ( Comparison )
- Inserting One by One
- Inserting with file upload in specific format
- Multiple sentence A and Multiple Sentence B ( Comparison )
- Company registration page
- Upload file
- API Help Page
-
Back End
- DB
- DB Models
- DB Layer
- REST API layer
- For Interaction with front-end
- For Interaction with other world
- Comparison Engine Integration
- DB
- Comparison Engine
- Corpus Based Algorithm - Embedding generator
- Sent-Sim Algorithm - Embedding generator
- Comparison Methods
- One to One
- One to Many
- Many to Many
- Jupyter Notebook
- Make Notebook showing tests and results
- Download customer support sample files
- Try using Quora trained Sent-BERT, better question comparision
-
Y. Li, D. McLean, Z. A. Bandar, J. D. O'Shea and K. Crockett, "Sentence similarity based on semantic nets and corpus statistics," in IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 8, pp. 1138-1150, Aug. 2006, doi: 10.1109/TKDE.2006.130.
-
Miller, G.A., 1995. WordNet: a lexical database for English. Communications of the ACM, 38(11), pp.39-41.
-
Reimers, N. and Gurevych, I., 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084.
-
Devlin, J., Chang, M.W., Lee, K. and Toutanova, K., 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.