Ecommerce Superset

This repository contains a collection of visualizations created with Apache Superset.

Featured image

The dashboards provide insights into various aspects of the Olist dataset, including product performance, sales trends, customer profiles, and more.

Overview

The visualizations in this repository explore different facets of the Olist dataset to offer valuable business insights. These dashboards include analysis on product performance by category, the influence of product weight on pricing, seller performance, payment methods, and more. Each visualization aims to identify trends and patterns that can inform business strategies and decision-making.

Column Descriptions

Each dataset used for the visualizations has specific columns that were leveraged for analysis:

How to Use This Data

  1. Clone the Repository: Use git clone https://github.com/yourusername/your-repository.git to get a local copy of the repository.
  2. Install Dependencies: Ensure you have Apache Superset installed to visualize the dashboards.
  3. Open Dashboards: Use Apache Superset to open and interact with the visualizations provided in the dashboards/ directory.

Exploring the Data

The dashboards offer various views and insights into the Olist dataset. Key visualizations include:

The visualizations reveal several trends:

Visualizations

  1. Presentation Order by Category
  2. The Influence of Product Weight and Dimensions on Price and Freight Value
  3. The Influence of Purchase Time on Order Status
  4. Seller Performance
  5. The Influence of the Number of Product Photos on Sales
  6. Comparison of Payment Methods to Payment Value
  7. Purchase Patterns Based on Time
  8. Customer Profile Based on Location and Purchase Frequency
  9. Sales Growth

This repository contains a collection of visualizations created with Apache Superset. The dashboards are designed to provide insights into various aspects of the Olist dataset, including product performance, sales trends, customer profiles, and more.

1. Presentation Order by Category

Objective: Identify the best and worst performing products by category. By linking data from olist_products_dataset and olist_order_items_dataset, you can analyze the most and least frequently purchased product categories. Description: From this graph, it can be concluded that household products such as bedding, bathroom, and dining equipment are very popular among consumers. Additionally, beauty and sports products also have quite high demand. This information can be a reference for businesses to allocate resources, such as inventory and promotions, to the most in-demand product categories. By understanding consumer preferences, businesses can increase sales and customer satisfaction.

2. The Influence of Product Weight and Dimensions on Price and Freight Value

Objective: Assess whether the weight and dimensions of products influence their price and freight value. Data from olist_products_dataset and olist_order_items_dataset can be used for this analysis. Description: This graph provides interesting insights into the relationship between product weight and dimensions and price and shipping costs. Each point on the graph represents a product category, with the size of the circle indicating the amount or value of a variable (e.g., total sales). The position of the point on the x- and y-axes indicates the overall weight and dimensions of the product.

From this graph, we can observe several things:

3. The Influence of Purchase Time on Order Status

Objective: Analyze whether there are specific time patterns when orders tend to experience delays or status changes. Data from olist_orders_dataset can be used to understand these trends. Description: This graph illustrates the relationship between purchase time and order status. There are three main metrics measured: total time to customer delivery, average delivery time, and average other time.

Overall, the graph shows that orders with a status of “delivered” have a shorter total time to customer delivery time than orders with a status of “canceled.” This suggests that faster order fulfillment tends to result in higher success rates. Additionally, the average other time for canceled orders is also higher, which may indicate additional bottlenecks or issues that are causing cancellations.

4. Seller Performance

Objective: Assess seller performance based on their location. By linking data from olist_sellers_dataset and olist_order_items_dataset, you can determine whether seller location affects their sales volume. Description: This graph provides a clear picture of the sales performance across cities in Brazil. Sao Paulo significantly dominates in terms of total orders and total sales, far above other cities. This shows that Sao Paulo is a very important and potential market for this business. Other cities such as Ibitinga, Curitiba, and Rio de Janeiro also contribute significantly to total sales, but are still far below Sao Paulo.

From this graph, we can draw several important conclusions:

5. The Influence of the Number of Product Photos on Sales

Objective: Measure whether the number of product photos affects sales levels. Data from olist_products_dataset and olist_order_items_dataset can be used for this analysis. Description: This pie chart provides a clear picture of the distribution of products based on the number of photos used. We can see that the majority of products (shown by the dark blue segment) have only 1 photo. This indicates that many sellers are not fully utilizing the potential of visuals to promote their products. In contrast, products with a higher number of photos (5 or more) have a much smaller proportion.

6. Comparison of Payment Methods to Payment Value

Objective: Analyze how payment methods (e.g., credit card vs. bank transfer) affect the average payment value. Data from olist_order_payments_dataset can be used for this insight. Description: From this graph, it can be concluded that credit cards are the most popular payment method and generate the largest transaction value. This indicates that most customers feel comfortable using credit cards to make transactions. Additionally, the low transaction value in the “not_defined” category shows the importance of ensuring that payment data is recorded correctly so that data analysis becomes more accurate. This information is very valuable for businesses to optimize payment strategies, such as offering special promotions for credit card users or providing more diverse payment options.

7. Purchase Patterns Based on Time

Objective: Analyze purchase patterns based on time of day or month to understand peak purchase periods. Data from olist_orders_dataset can be used to determine these patterns. Description: This graph shows a significant upward trend in the number of orders from October 2017 to a peak in early 2018. After that, the number of orders tends to be stable with little fluctuation. A fairly clear seasonal pattern is seen, with a significant increase occurring towards the end of the year (most likely due to the holiday season) and a relative decline at the beginning of the year.

Some insights that we can draw from this graph are:

8. Customer Profile Based on Location and Purchase Frequency

Objective: Identify customer characteristics such as location (city, state) and purchase frequency to determine the most active customer segments. Data from olist_customers_dataset and olist_order_items_dataset can provide these insights. Description: This visualization provides an interesting overview of customer profiles based on location and purchase frequency. Large cities such as Sao Paulo and Rio de Janeiro have a very significant number of customers, indicated by their larger land area. This indicates that these two cities are key markets for the business in question. Additionally, there is variation in purchase frequency across cities, indicated by the different colors or color intensity of each small box. Cities with lighter colors tend to have higher purchase frequencies.

Overall, this visualization can be used to identify cities with high growth potential, as well as to design more effective marketing strategies based on customer characteristics in each location. For example, businesses can allocate more marketing resources to cities with high purchase frequencies, or develop products and services that are more in line with customer preferences in each region. Additionally, this visualization can also help identify interesting purchasing patterns, such as certain seasons or special events that trigger increased sales in certain regions.

9. Sales Growth

Objective: Identify months with significant sales growth to plan better marketing and promotional strategies in the future. This analysis can also provide insights into strong or weak sales periods, helping in inventory planning and other business strategies. Description: Insights from the Sales Growth Chart:

Reporting

For any questions or further information regarding these visualizations, please open an issue or contact via LinkedIn or GitHub.

Dataset Source

The visualizations are based on the Olist dataset, available at Postgreesql.