Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update dev #84

Merged
merged 13 commits into from
Aug 16, 2024
25 changes: 13 additions & 12 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@


# STREAM
We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Module for user-friendly topic modelling and especially subsequent interactive topic visualization and analysis.
We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Module for user-friendly topic modelling and especially subsequent interactive topic visualization and analysis. Our paper can be found [here](https://aclanthology.org/2024.acl-short.41.pdf).

# Table of Contents
- [STREAM](#stream)
Expand Down Expand Up @@ -46,7 +46,7 @@ We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Modul
- [Validation Criteria](#validation-criteria)
- [Submitting Your Contribution](#submitting-your-contribution)
- [Citation](#citation)
- [Paper 1 TBD](#paper-1-tbd)
- [STREAM](#stream-1)
- [Metrics and CEDC](#metrics-and-cedc)
- [TNTM](#tntm)
- [DCTE](#dcte)
Expand Down Expand Up @@ -80,6 +80,9 @@ Make additionally sure to download the necessary [nltk](https://www.nltk.org/) r

```python
import nltk
nltk.download('stopwords')
nltk.download('punkt')
nltk.download('wordnet')
nltk.download('averaged_perceptron_tagger')
```

Expand Down Expand Up @@ -604,18 +607,16 @@ If you want to include a new model where these guidelines are not approriate ple

If you use this project in your research, please consider citing:

### Paper 1 TBD
### STREAM

```bibtex
@article{your_paper_key1,
title={Your Paper Title},
author={Your Name and Co-Author's Name},
journal={Journal/Conference Name},
year={Year},
volume={Volume},
number={Number},
pages={Pages},
doi={link_to_doi}
@inproceedings{thielmann-etal-2024-stream,
title = {STREAM: Simplified Topic Retrieval, Exploration, and Analysis Module},
author = {Thielmann, Anton and Reuter, Arik and Weisser, Christoph and Kant, Gillian and Kumar, Manish and S{\"a}fken, Benjamin},
booktitle = {Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
year = {2024},
publisher = {Association for Computational Linguistics},
pages = {435--444},
}
```

Expand Down
6 changes: 5 additions & 1 deletion docs/installation.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,9 +19,13 @@ pip install stream_topic

### Install the required dependencies:

Make sure you have installed the required dependencies before installing `stream_topic`. You can install the required dependencies using the following command:
Make sure you have installed the required dependencies before running any models from `stream_topic`. You can install the required dependencies using the following command:

```bash
import nltk

nltk.download('punkt')
nltk.download('wordnet')
nltk.download('stopwords')
nltk.download('averaged_perceptron_tagger')
```
21 changes: 11 additions & 10 deletions docs/landingpage.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# STREAM
We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Module for user-friendly topic modelling and especially subsequent interactive topic visualization and analysis.
We present STREAM, a Simplified Topic Retrieval, Exploration, and Analysis Module for user-friendly topic modelling and especially subsequent interactive topic visualization and analysis. Our paper can be found [here](https://aclanthology.org/2024.acl-short.41.pdf).

For better topic analysis, we implement multiple intruder-word based topic evaluation metrics. Additionally, we publicize multiple new datasets that can extend the so far very limited number of publicly available benchmark datasets in topic modeling. We integrate downstream interpretable analysis modules to enable users to easily analyse the created topics in downstream tasks together with additional tabular information.

Expand Down Expand Up @@ -28,6 +28,9 @@ Make additionally sure to download the necessary [nltk](https://www.nltk.org/) r

```python
import nltk
nltk.download('stopwords')
nltk.download('punkt')
nltk.download('wordnet')
nltk.download('averaged_perceptron_tagger')
```

Expand Down Expand Up @@ -416,15 +419,13 @@ If you use this project in your research, please consider citing:
### Paper 1 TBD

```bibtex
@article{your_paper_key1,
title={Your Paper Title},
author={Your Name and Co-Author's Name},
journal={Journal/Conference Name},
year={Year},
volume={Volume},
number={Number},
pages={Pages},
doi={link_to_doi}
@inproceedings{thielmann-etal-2024-stream,
title = {STREAM: Simplified Topic Retrieval, Exploration, and Analysis Module},
author = {Thielmann, Anton and Reuter, Arik and Weisser, Christoph and Kant, Gillian and Kumar, Manish and S{\"a}fken, Benjamin},
booktitle = {Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
year = {2024},
publisher = {Association for Computational Linguistics},
pages = {435--444},
}
```

Expand Down
Loading