-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME.Rmd
executable file
·78 lines (53 loc) · 2.73 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
---
output: github_document
---
```{r, echo = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-"
)
```
# cleanTS
<!-- badges: start -->
[data:image/s3,"s3://crabby-images/cac10/cac1031678886123620bf476356808143c0604f8" alt="CRAN status"](https://CRAN.R-project.org/package=cleanTS)
[data:image/s3,"s3://crabby-images/39562/395623728c4fe63a24563d0644e4d92a3b7b0499" alt="R-CMD-check"](https://github.com/Mayur1009/cleanTS/actions/workflows/R-CMD-check.yaml)
[data:image/s3,"s3://crabby-images/d47ec/d47ec1157ea15a7fb737e0f94ec675a5c193c843" alt="Lifecycle: stable"](https://lifecycle.r-lib.org/articles/stages.html#stable)
<!-- badges: end -->
`cleanTS` package focuses on developing a tool for making the process of cleaning large datasets simple and efficient. Currently it solely focuses on data cleaning for univariate time series data. The package is integrated with already developed and deployed tools for missing value imputation. It also provides a way for visualizing data at different resolutions, allowing micro-scale visualization. The ultimate goal is the creation of a handy software tool that deals with the problems, processes, analysis and visualization of big data time series, with minimum human intervention.
* `cleanTS()` checks the data for missing and duplicate timestamps, performs missing value imputation and removes anomalies/outliers from the data.
* `animate_interval()` splits the data and generates an animated plot.
* `interact_plot()` is similar to `animate_interval()` but creates an interactive plot which provides relatively more control over the animation.
The package can also be used using a shiny application, available at [https://mayur1009.shinyapps.io/cleanTS/](https://mayur1009.shinyapps.io/cleanTS/).
Package Documentation can be found at [https://mayur1009.github.io/cleanTS/](https://mayur1009.github.io/cleanTS/).
This project was a part of [Google Summer of Code 2021](https://summerofcode.withgoogle.com/projects/#4626948166254592).
## Installation
```{r Installation, eval=F}
# Install release version from CRAN
install.packages("cleanTS")
# Install development version from GitHub
devtools::install_github("Mayur1009/cleanTS")
```
## Example
```{r Example}
library(cleanTS)
# Read sunspot.month dataset
data <- timetk::tk_tbl(sunspot.month)
print(data)
# Randomly insert missing values to simulate missing value imputation
set.seed(10)
ind <- sample(nrow(data), 100)
data$value[ind] <- NA
# Create `cleanTS` object
cts <- cleanTS(data, date_format = c("my"))
summary(cts)
# Cleaned Data
head(cts$clean_data)
# Genearate animated plot
a <- animate_interval(cts, interval = "10 year")
gen.animation(a, height = 700, width = 900)
```
```{r InteractPlot, eval = F}
# Generate interactive plot
interact_plot(cts, interval = "10 year")
```