ott_v1.png

Github

https://github.com/LeeHaEun1/OTT_Log_Analysis

Project Overview

Data Overview

Objectives

  1. Clarifying Field Definitions
  2. Data Preprocessing
  3. Multi-dimensional Visualization Design
  4. Extracting Business Insights

Data Preprocessing

table1.png

  1. As a preliminary check, I performed a simple missing value analysis, which revealed that two fields—Query Typed and Displayed Name—contained null values. These fields appeared to be semantically related to the Section field, which indicates how the result was generated. (Table 1)

table2.png

  1. The Section field was a categorical variable with three values: Prequery Results, Suggestion Results, and Title Results. However, since these categories were not clearly defined, I clarified their meanings through a combination of online research and logical inference before proceeding with the analysis. (Table 2)

table3.png

  1. According to the clarified definitions in Table 2, the three cases listed in Table 3 were logically inconsistent, so I removed the corresponding rows from the dataset.
    1. Case 1 and 2: The user did not type a query (Query Typed = null), but the result was categorized as generated based on user input (Section = Suggestion Results or Title Results).
    2. Case 3: There was no record of an autocomplete result (Displayed Name = null), yet the result was categorized as system-generated (Section = Suggestion Results).