This is a statistical analysis on an ecommerce retail store that has customers in 38 countries. The dataset contains information on sales and customers for a year.Purchases were made in US dollars.
The dataset was provided by forage for their virtual experience program
Initial dataset contained 541909 rows and 8 columns. After data wrangling, there were 420738 rows and 12 columns,ready for analysis.
Some insights this projects seeks to derive are:
-Customers who brought in the most revenue
-The most and least performing Countries
-Sales Trend throughout the year
-Relationship between subgroups in the data
-Relationship between number of items purchased and revenue, with a hypothesis test to prove it's significance.
-Significant mean difference in sales between countries
Libraries used for analysis:
pandas
seaborn
matplotlib
numpy
statsmodels
wordcloud
My pleasure...!