2018 搜狐内容识别算法大赛
- Html filter
- Segmentation
- Extra-features
- Data Augementation
EDA
- Word_tfidf
- Char_tfidf
- Word2vec
Models
- NBSVM
- LGBM
- TextCNN
- RCNN
- Bi-LSTM
- Bi-GRU
Ensemble
- Word2vec dimentions
- Embedding layer
- 01-2 0-1 classification
- Keywords
- Extract text
- Text Recognition
- Text Classification
- Area Filtering (CTPN)
See more detail in my blog https://sanshibayuan.github.io/