#1141 User Activity Past 30 Days I - Solution

Table: Activity

+---------------+---------+
| Column Name   | Type    |
+---------------+---------+
| user_id       | int     |
| session_id    | int     |
| activity_date | date    |
| activity_type | enum    |
+---------------+---------+
This table may have duplicate rows.
The activity_type column is an ENUM (category) of type ('open_session', 'end_session', 'scroll_down', 'send_message').
The table shows the user activities for a social media website. 
Note that each session belongs to exactly one user.

Write a solution to find the daily active user count for a period of 30 days ending 2019-07-27 inclusively. A user was active on someday if they made at least one activity on that day.

Return the result table in any order.

The result format is in the following example.

Example 1:

Input: 
Activity table:
+---------+------------+---------------+---------------+
| user_id | session_id | activity_date | activity_type |
+---------+------------+---------------+---------------+
| 1       | 1          | 2019-07-20    | open_session  |
| 1       | 1          | 2019-07-20    | scroll_down   |
| 1       | 1          | 2019-07-20    | end_session   |
| 2       | 4          | 2019-07-20    | open_session  |
| 2       | 4          | 2019-07-21    | send_message  |
| 2       | 4          | 2019-07-21    | end_session   |
| 3       | 2          | 2019-07-21    | open_session  |
| 3       | 2          | 2019-07-21    | send_message  |
| 3       | 2          | 2019-07-21    | end_session   |
| 4       | 3          | 2019-06-25    | open_session  |
| 4       | 3          | 2019-06-25    | end_session   |
+---------+------------+---------------+---------------+
Output: 
+------------+--------------+ 
| day        | active_users |
+------------+--------------+ 
| 2019-07-20 | 2            |
| 2019-07-21 | 2            |
+------------+--------------+ 
Explanation: Note that we do not care about days with zero active users.

Table: Activity

+---------------+---------+
| Column Name   | Type    |
+---------------+---------+
| user_id       | int     |
| session_id    | int     |
| activity_date | date    |
| activity_type | enum    |
+---------------+---------+
This table may have duplicate rows.
The activity_type column is an ENUM (category) of type ('open_session', 'end_session', 'scroll_down', 'send_message').
The table shows the user activities for a social media website. 
Note that each session belongs to exactly one user.

Write a solution to find the daily active user count for a period of 30 days ending 2019-07-27 inclusively. A user was active on someday if they made at least one activity on that day.

Return the result table in any order.

The result format is in the following example.

Example 1:

Input: 
Activity table:
+---------+------------+---------------+---------------+
| user_id | session_id | activity_date | activity_type |
+---------+------------+---------------+---------------+
| 1       | 1          | 2019-07-20    | open_session  |
| 1       | 1          | 2019-07-20    | scroll_down   |
| 1       | 1          | 2019-07-20    | end_session   |
| 2       | 4          | 2019-07-20    | open_session  |
| 2       | 4          | 2019-07-21    | send_message  |
| 2       | 4          | 2019-07-21    | end_session   |
| 3       | 2          | 2019-07-21    | open_session  |
| 3       | 2          | 2019-07-21    | send_message  |
| 3       | 2          | 2019-07-21    | end_session   |
| 4       | 3          | 2019-06-25    | open_session  |
| 4       | 3          | 2019-06-25    | end_session   |
+---------+------------+---------------+---------------+
Output: 
+------------+--------------+ 
| day        | active_users |
+------------+--------------+ 
| 2019-07-20 | 2            |
| 2019-07-21 | 2            |
+------------+--------------+ 
Explanation: Note that we do not care about days with zero active users.

In #1141 User Activity for the Past 30 Days I, the goal is to calculate the number of daily active users within the 30‑day window ending on a specified date. The key idea is to analyze activity records and determine how many distinct user_id values appear for each day.

Start by filtering the activity records to include only dates within the last 30 days of the reference date. After narrowing the dataset, group the records by activity_date. For each date group, count the number of unique users using COUNT(DISTINCT user_id) to avoid counting multiple activities from the same user more than once.

This grouped result gives the number of active users per day in the required window. Optionally, sort the output by date to present results chronologically. The approach mainly relies on SQL operations such as date filtering, grouping, and distinct counting, making it efficient and straightforward.

The overall performance depends on scanning the filtered activity records, which is typically O(n) in time with minimal additional space.

Approach	Time Complexity	Space Complexity
Date filtering + GROUP BY with COUNT(DISTINCT)	O(n)	O(d)

This approach uses SQL aggregation functions to filter activities in the last 30 days, group the activities by date, and count distinct user IDs for each date. We'll make use of the DATE_SUB function to get the starting date and apply GROUP BY and COUNT DISTINCT functions to get the active user count per day.

Time Complexity is approximately O(n) where n is the number of records in the table, because SQL has to scan all entries to filter and group them. Space Complexity is O(k) where k is the number of unique days with activities in the range, as that's the size of the result set.

This SQL statement selects dates and counts distinct users who were active on those dates. It filters the dates to the last 30 days ending on 2019-07-27 using the BETWEEN clause and DATE_SUB function. The GROUP BY activity_date groups the results by each day, and COUNT(DISTINCT user_id) counts the number of unique users active on each of those days.

This approach involves using a script to fetch and process data entries to count unique users per day. The script aggregates the data manually by iterating over each record, filtering by date, and maintaining a set of unique users for each activity date.

Time Complexity is O(n), where n is the number of activity records, as each record is processed separately. Space Complexity is O(d + u) where d is the number of days in the result and u is the total number of unique users aggregated in the set for all days.

Everyday Data Science

6:126,436 views

This Python code uses a defaultdict to map each activity_date to a set of unique user_ids, accumulating only those records where the date falls between 2019-06-28 and 2019-07-27. The use of sets ensures all user_ids are distinct for each day. Finally, it returns a list of tuples showing each day and the count of its active users.

1141. User Activity for the Past 30 Days I

Problem Statement

1141. User Activity for the Past 30 Days I

Problem Statement

Approach

Complexity

Video Solution Available

Solutions (3)

SQL Aggregation with Date Filtering

Explanation

Data Processing with Scripting

Video Solutions

LeetCode 1141 "User Activity Past 30 Days I" Meta Interview SQL Question with Detailed Explanation

User Activity for the Past 30 Days I | Leetcode 1141 | Crack SQL Interviews in 50 Qs #mysql

24. User Activity for the Past 30 Days I | SQL Interview Questions and Answers

LeetCode 1142 "User Activity for the Past 30 Days II" Netflix Interview SQL Question Explanation

1141. User Activity for the Past 30 Days I | Leetcode SQL Easy

MLV Prasad - LeetCode SQL [ EASY ] | 1141 | "User Activity for the Past 30 Days I" |

LeetCode 1141: User Activity for the Past 30 Days I

MLV Prasad - LeetCode SQL [ EASY ] | 1142 | "User Activity for the Past 30 Days II" |

1142. User Activity for the Past 30 Days II | Leetcode SQL Easy

Leetcode 38. Count and Say | Python Solution + Step-by-Step Explanation

Asked By Companies

Prepare for Interviews

Notes

Personal Notes

Similar Problems

Related Topics

Problem Stats

Practice on LeetCode

Frequently Asked Questions

Is User Activity for the Past 30 Days I asked in FAANG interviews?

What is the optimal approach for User Activity for the Past 30 Days I?

Why do we use COUNT(DISTINCT user_id) in User Activity for the Past 30 Days I?

What data structure or SQL concept is best for User Activity for the Past 30 Days I?

Explanation