Allow hashing DataArrays with NA values #154

alyst · 2015-06-15T13:48:47Z

Without this DataFrames.nonunique() and friends do not work on frames with NA rows.

johnmyleswhite · 2015-06-15T14:29:25Z

This seems like a good strategy. I'm confused why the old approach wouldn't have worked, though -- it seems like it should just be too slow.

alyst · 2015-06-15T14:32:19Z

For me on v0.4 the old code was throwing InexactError or something like that when NAs were hashed.

johnmyleswhite · 2015-06-15T14:35:19Z

I'm going to hold off merging for a bit to give others a chance to review, but the CI failure seems unrelated and this seems good to go.

alyst · 2015-06-15T14:38:55Z

I've just thought it might be better to use findnext(BitVector) to skip NAs. I can resubmit an improved version.

johnmyleswhite · 2015-06-15T14:42:44Z

Sounds good. I would do some profiling to make sure that it's worth the effort; for the almost no-NA case I imagine it will be meaningfully slower to use findnext.

alyst · 2015-06-15T14:57:32Z

OK, I've replaced the PR with the findnext() version. Both approaches should do more or less the same bit magic, so it's hard to say what would happen in the average "dense" case, but for "sparse" case findnext() should be faster.

allow hashing DataArrays with NA values

d4fdfbc

alyst force-pushed the hash_na branch from a9d32c7 to d4fdfbc Compare June 15, 2015 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow hashing DataArrays with NA values #154

Allow hashing DataArrays with NA values #154

alyst commented Jun 15, 2015

johnmyleswhite commented Jun 15, 2015

alyst commented Jun 15, 2015

johnmyleswhite commented Jun 15, 2015

alyst commented Jun 15, 2015

johnmyleswhite commented Jun 15, 2015

alyst commented Jun 15, 2015

Allow hashing DataArrays with NA values #154

Are you sure you want to change the base?

Allow hashing DataArrays with NA values #154

Conversation

alyst commented Jun 15, 2015

johnmyleswhite commented Jun 15, 2015

alyst commented Jun 15, 2015

johnmyleswhite commented Jun 15, 2015

alyst commented Jun 15, 2015

johnmyleswhite commented Jun 15, 2015

alyst commented Jun 15, 2015