4GB CSV file turns into 8GB JuliaDB file

This [post](https://discourse.julialang.org/t/juliadb-questions-issues/24785/5) documents a problem with a very basic use case of JuliaDB. 

```
using JuliaDB
acs = loadtable("psam_pusa.csv", type_detect_rows=200)
save(acs, "test")
```
yields an 8GB file, although `psam_pusa.csv` is only 4GB. The infered types are two `String`s, many `Int64`s and many `Union{Missing,Int64}`s.

```
acs = loadtable("C:\\Users\\Max\\Desktop\\psam_pusa.csv", 
    colparsers = vcat(String, repeat([Union{Missing,Int64}], 95), String, repeat([Union{Missing,Int64}], 30), String, repeat([Union{Missing,Int64}], 158)))
```
yields a 2.5GB file.

Does the column type inference work properly? Or is it a storage problem of JuliaDB.
I am on Julia 1.1.0, TextParse 0.9.1+, and JuliaDB 0.12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

4GB CSV file turns into 8GB JuliaDB file #137

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

4GB CSV file turns into 8GB JuliaDB file #137

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions