GitHub - svjack/Sbert-ChineseExample: Sentence-Transformers Information Retrieval example on Chinese

* 1 这个工程使用自定义的 es-pandas 的重载接口 (支持向量存储) 来使用pandas对于elasticsearch实现简单的操作。
* 2 try_sbert_neg_sampler.py 抽取困难样本（模型识别困难的样本）的功能来自于 https://guzpenha.github.io/transformer_rankers/，也可以使用 elasticsearch 生成困难样本, 相应的功能在 valid_cross_encoder_on_bi_encoder.py 中定义。
* 3 上面在 cross_encoder 上训练的功能, 需要预先在不同的句子间检查语义区别程度，组合相似语义的样本对于模型训练是有帮助的。
* 4 增加了一些对Sentence-Transformers多类别结果比较的工具。

贡献

Contributing

License

Distributed under the MIT License. See LICENSE for more information.

Contact

svjack - svjackbt@gmail.com ehangzhou@outlook.com

Project Link: https://github.com/svjack/Sbert-ChineseExample

Name	Name	Last commit message	Last commit date
Latest commit svjack Update bi-encoder-batch.py Feb 18, 2024 1777be2 · Feb 18, 2024 History 9 Commits
.spyproject	.spyproject	upload	Jan 21, 2021
script	script	Update bi-encoder-batch.py	Feb 18, 2024
README.md	README.md	Create README.md	Mar 31, 2023
README_EN.md	README_EN.md	Update README_EN.md	Mar 31, 2023
requirements.txt	requirements.txt	upload	Jan 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sbert-ChineseExample

内容提要

关于这个工程

About The Project

构建信息

Built With

开始

Getting Started

安装

Installation

使用

Usage

引导

Roadmap

贡献

Contributing

License

Contact

Acknowledgements

About

Releases

Packages

Languages

svjack/Sbert-ChineseExample

Folders and files

Latest commit

History

Repository files navigation

Sbert-ChineseExample

内容提要

关于这个工程

About The Project

构建信息

Built With

开始

Getting Started

安装

Installation

使用

Usage

引导

Roadmap

贡献

Contributing

License

Contact

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages