Document limitations of Bounter #37

aneesh-joshi · 2018-10-11T14:30:22Z

Credit goes to the people on this page.
I merely worded their concerns.

Fixes #36 .

CC: @menshikh-iv

merge Rare into aneesh

menshikh-iv · 2018-10-12T04:02:07Z

Thanks @aneesh-joshi

@piskvorky any comments?

piskvorky · 2018-10-12T05:22:50Z

README.md

+```python
+from bounter import bounter
+bounts = bounter(size_mb=1)
+bounts.update(str(i) for i in range(1000000))


Add another 0 (10000000), to make it clear this is not related to 1 MB.

piskvorky · 2018-10-12T05:23:18Z

README.md

+
+
+## When not to use Bounter?
+Beware, Bounter is only a probabilistic frequency counter and cannot be relied on for fine counting. (You can't expect a data structure with finite size to hold infinite data.)


fine => exact

piskvorky · 2018-10-12T05:25:23Z

README.md

+0
+```
+
+Please use `Counter` or `dict` when such fine counts matter. When they don't matter, like in most NLP applications with a huge corpora, Bounter is a very good alternative.


fine => exact.

piskvorky · 2018-10-12T05:27:07Z

README.md

+0
+```
+
+Please use `Counter` or `dict` when such fine counts matter. When they don't matter, like in most NLP applications with a huge corpora, Bounter is a very good alternative.


…like in most NLP and ML applications with huge datasets, …

piskvorky · 2018-10-12T05:28:59Z

README.rst

+When not to use Bounter?
+------------------------
+
+Beware, Bounter is only a probabilistic frequency counter and cannot be relied on for fine counting. (You can't expect a data structure with finite size to hold infinite data.)


Is this file auto-generated? Or why are we keeping two parallel READMEs?

Anyway, the same changes here please.

In this case, I propose to stay only .rst (because this used for PyPI too) and drop markdown version (to don't support 2 almost same versions)

I wonder if they're out of sync already… @aneesh-joshi can you check?

@piskvorky
I will check. The .rst is not auto generated. I manually edited it.

piskvorky · 2018-10-12T05:29:31Z

Good idea! Thanks.

menshikh-iv · 2018-10-15T03:53:39Z

@piskvorky README rst & md really not synced yet. I still think that we should stay only one version (create best .rst version & drop markdown version to avoid "de-sync" state).

piskvorky · 2018-10-15T05:51:13Z

I still think…

Did we already discuss this? To me a single version (.rst) sounds better too, but I don't remember the pros/cons. What were the arguments against it? Why are we keeping both .rst and .md?

menshikh-iv · 2018-10-15T06:01:52Z

I don't remember (I'm even not sure if we discussed this)

Maybe @isamaru remember?

Right now I see no pros to maintain both .md and .rst, I prefer .md, but PyPI can render only .rst, for this reason, single .rst should be our choice.

aneesh-joshi · 2018-10-15T12:46:41Z

@menshikh-iv @piskvorky
I've made the mentioned changes.
Waiting on your decision on whether to keep .rst or .md

Some questions:

Does .rst render properly on github?
How do I autogenerate the .rst? (I can't remember it and couldn't get the exact one through google)

piskvorky · 2018-10-15T14:28:45Z

README.md

 bounts['100']
 0
 ```

-Please use `Counter` or `dict` when such fine counts matter. When they don't matter, like in most NLP applications with a huge corpora, Bounter is a very good alternative.
+Please use `Counter` or `dict` when such exact counts matter. When they don't matter, like in most NLP and ML applications with a huge datasets, Bounter is a very good alternative.


a huge datasets => huge datasets.

Shoot! Missed that somehow.

piskvorky · 2018-10-15T14:28:56Z

README.rst

    bounts['100']
    0

-Please use ``Counter`` or ``dict`` when such fine counts matter. When they don't matter, like in most NLP applications with a huge corpora, Bounter is a very good alternative.
+Please use ``Counter`` or ``dict`` when such exact counts matter. When they don't matter, like in most NLP and ML applications with a huge datasets, Bounter is a very good alternative.


menshikh-iv · 2018-10-15T21:25:39Z

@aneesh-joshi

Does .rst render properly on github?

Yes

How do I autogenerate the .rst? (I can't remember it and couldn't get the exact one through google)

is http://pandoc.org/try/ works OK?

aneesh-joshi · 2018-10-16T00:58:58Z

@menshikh-iv
I autogenerated the .rst with
pandoc --from=markdown --to=rst --output=README.rst README.md

but the generated file has a huge diff compared with the original .rst

menshikh-iv · 2018-10-16T05:57:59Z

@aneesh-joshi that's expected: if you looking into commit history, you'll see a huge diff between this 2 files.

Here you need to do some manual work ("join" both to one file).

aneesh-joshi · 2018-10-23T13:32:54Z

@menshikh-iv
If you see the diff, there is a very large change. Editing it manually will be tedious.
Is there a version mismatch or config mismatch in the pandocs?
Can i just keep the old version of the rst?

menshikh-iv · 2018-10-25T03:52:49Z

@aneesh-joshi okay, but please raise an issue about syncing .rst and .md

aneesh-joshi · 2018-10-25T23:31:56Z

@menshikh-iv
No problem. I fixed it (I think?).

README.rst

aneesh-joshi · 2018-12-16T05:18:37Z

ping @menshikh-iv @piskvorky

menshikh-iv · 2019-01-17T03:14:51Z

Thanks @aneesh-joshi 🚀

aneesh-joshi added 8 commits March 7, 2018 01:01

fix incorrect image link

36fec75

update readme with blog link

c424982

Merge pull request #1 from RaRe-Technologies/master

5e76e21

merge Rare into aneesh

Merge branch 'master' of https://github.com/RaRe-Technologies/bounter

3dd82e8

update readme with when not to use Bounter

d20f017

remove gitignore changes

b2616c3

remove gitignore changes

63e019e

add Readme.rst

8405abc

piskvorky requested changes Oct 12, 2018

View reviewed changes

make minor changes in readme

63b246f

piskvorky requested changes Oct 15, 2018

View reviewed changes

aneesh-joshi added 2 commits October 15, 2018 11:38

Update README.md

c58afae

Update README.rst

f8c84bb

aneesh-joshi added 2 commits October 23, 2018 09:24

add auto generated rst

14aa962

resolve Merge conf

8014b83

aneesh-joshi added 2 commits October 25, 2018 19:27

reconcile .rst with autogenerated file

9dbd6b4

add extra line at end

7ad7484

piskvorky requested changes Oct 26, 2018

View reviewed changes

README.rst Outdated Show resolved Hide resolved

Update README.rst

4be5afb

menshikh-iv changed the title ~~Add limitations of Bounter. Addresses #36~~ Document limitations of Bounter Jan 17, 2019

menshikh-iv merged commit 98e6ba6 into piskvorky:master Jan 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document limitations of Bounter #37

Document limitations of Bounter #37

aneesh-joshi commented Oct 11, 2018 •

edited by piskvorky

Loading

menshikh-iv commented Oct 12, 2018

piskvorky Oct 12, 2018

piskvorky Oct 12, 2018

piskvorky Oct 12, 2018

piskvorky Oct 12, 2018

piskvorky Oct 12, 2018

menshikh-iv Oct 12, 2018

piskvorky Oct 12, 2018

aneesh-joshi Oct 12, 2018

piskvorky commented Oct 12, 2018

menshikh-iv commented Oct 15, 2018

piskvorky commented Oct 15, 2018

menshikh-iv commented Oct 15, 2018

aneesh-joshi commented Oct 15, 2018

piskvorky Oct 15, 2018

aneesh-joshi Oct 15, 2018

piskvorky Oct 15, 2018

menshikh-iv commented Oct 15, 2018

aneesh-joshi commented Oct 16, 2018

menshikh-iv commented Oct 16, 2018

aneesh-joshi commented Oct 23, 2018

menshikh-iv commented Oct 25, 2018

aneesh-joshi commented Oct 25, 2018

aneesh-joshi commented Dec 16, 2018

menshikh-iv commented Jan 17, 2019



		## When not to use Bounter?
		Beware, Bounter is only a probabilistic frequency counter and cannot be relied on for fine counting. (You can't expect a data structure with finite size to hold infinite data.)

Document limitations of Bounter #37

Document limitations of Bounter #37

Conversation

aneesh-joshi commented Oct 11, 2018 • edited by piskvorky Loading

menshikh-iv commented Oct 12, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

piskvorky commented Oct 12, 2018

menshikh-iv commented Oct 15, 2018

piskvorky commented Oct 15, 2018

menshikh-iv commented Oct 15, 2018

aneesh-joshi commented Oct 15, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

menshikh-iv commented Oct 15, 2018

aneesh-joshi commented Oct 16, 2018

menshikh-iv commented Oct 16, 2018

aneesh-joshi commented Oct 23, 2018

menshikh-iv commented Oct 25, 2018

aneesh-joshi commented Oct 25, 2018

aneesh-joshi commented Dec 16, 2018

menshikh-iv commented Jan 17, 2019

aneesh-joshi commented Oct 11, 2018 •

edited by piskvorky

Loading