Skip to content

Commit 20c7d0f

Browse files
authored
Merge pull request AnFreTh#84 from AnFreTh/main
update dev
2 parents 24e35b5 + aacab0c commit 20c7d0f

File tree

8 files changed

+502
-333
lines changed

8 files changed

+502
-333
lines changed

README.md

+13-12
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@
1717

1818

1919
# STREAM
20-
We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Module for user-friendly topic modelling and especially subsequent interactive topic visualization and analysis.
20+
We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Module for user-friendly topic modelling and especially subsequent interactive topic visualization and analysis. Our paper can be found [here](https://aclanthology.org/2024.acl-short.41.pdf).
2121

2222
# Table of Contents
2323
- [STREAM](#stream)
@@ -46,7 +46,7 @@ We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Modul
4646
- [Validation Criteria](#validation-criteria)
4747
- [Submitting Your Contribution](#submitting-your-contribution)
4848
- [Citation](#citation)
49-
- [Paper 1 TBD](#paper-1-tbd)
49+
- [STREAM](#stream-1)
5050
- [Metrics and CEDC](#metrics-and-cedc)
5151
- [TNTM](#tntm)
5252
- [DCTE](#dcte)
@@ -80,6 +80,9 @@ Make additionally sure to download the necessary [nltk](https://www.nltk.org/) r
8080

8181
```python
8282
import nltk
83+
nltk.download('stopwords')
84+
nltk.download('punkt')
85+
nltk.download('wordnet')
8386
nltk.download('averaged_perceptron_tagger')
8487
```
8588

@@ -604,18 +607,16 @@ If you want to include a new model where these guidelines are not approriate ple
604607

605608
If you use this project in your research, please consider citing:
606609

607-
### Paper 1 TBD
610+
### STREAM
608611

609612
```bibtex
610-
@article{your_paper_key1,
611-
title={Your Paper Title},
612-
author={Your Name and Co-Author's Name},
613-
journal={Journal/Conference Name},
614-
year={Year},
615-
volume={Volume},
616-
number={Number},
617-
pages={Pages},
618-
doi={link_to_doi}
613+
@inproceedings{thielmann-etal-2024-stream,
614+
title = {STREAM: Simplified Topic Retrieval, Exploration, and Analysis Module},
615+
author = {Thielmann, Anton and Reuter, Arik and Weisser, Christoph and Kant, Gillian and Kumar, Manish and S{\"a}fken, Benjamin},
616+
booktitle = {Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
617+
year = {2024},
618+
publisher = {Association for Computational Linguistics},
619+
pages = {435--444},
619620
}
620621
```
621622

docs/installation.md

+5-1
Original file line numberDiff line numberDiff line change
@@ -19,9 +19,13 @@ pip install stream_topic
1919

2020
### Install the required dependencies:
2121

22-
Make sure you have installed the required dependencies before installing `stream_topic`. You can install the required dependencies using the following command:
22+
Make sure you have installed the required dependencies before running any models from `stream_topic`. You can install the required dependencies using the following command:
2323

2424
```bash
2525
import nltk
26+
27+
nltk.download('punkt')
28+
nltk.download('wordnet')
29+
nltk.download('stopwords')
2630
nltk.download('averaged_perceptron_tagger')
2731
```

docs/landingpage.md

+11-10
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# STREAM
2-
We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Module for user-friendly topic modelling and especially subsequent interactive topic visualization and analysis.
2+
We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Module for user-friendly topic modelling and especially subsequent interactive topic visualization and analysis. Our paper can be found [here](https://aclanthology.org/2024.acl-short.41.pdf).
33

44
For better topic analysis, we implement multiple intruder-word based topic evaluation metrics. Additionally, we publicize multiple new datasets that can extend the so far very limited number of publicly available benchmark datasets in topic modeling. We integrate downstream interpretable analysis modules to enable users to easily analyse the created topics in downstream tasks together with additional tabular information.
55

@@ -28,6 +28,9 @@ Make additionally sure to download the necessary [nltk](https://www.nltk.org/) r
2828

2929
```python
3030
import nltk
31+
nltk.download('stopwords')
32+
nltk.download('punkt')
33+
nltk.download('wordnet')
3134
nltk.download('averaged_perceptron_tagger')
3235
```
3336

@@ -416,15 +419,13 @@ If you use this project in your research, please consider citing:
416419
### Paper 1 TBD
417420

418421
```bibtex
419-
@article{your_paper_key1,
420-
title={Your Paper Title},
421-
author={Your Name and Co-Author's Name},
422-
journal={Journal/Conference Name},
423-
year={Year},
424-
volume={Volume},
425-
number={Number},
426-
pages={Pages},
427-
doi={link_to_doi}
422+
@inproceedings{thielmann-etal-2024-stream,
423+
title = {STREAM: Simplified Topic Retrieval, Exploration, and Analysis Module},
424+
author = {Thielmann, Anton and Reuter, Arik and Weisser, Christoph and Kant, Gillian and Kumar, Manish and S{\"a}fken, Benjamin},
425+
booktitle = {Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
426+
year = {2024},
427+
publisher = {Association for Computational Linguistics},
428+
pages = {435--444},
428429
}
429430
```
430431

0 commit comments

Comments
 (0)