-
Notifications
You must be signed in to change notification settings - Fork 0
/
Google Maps Flagged UGC [Google SQL Interview Question]
57 lines (51 loc) · 1.81 KB
/
Google Maps Flagged UGC [Google SQL Interview Question]
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
with cte as (select m.content_id,m.place_id,m.content_tag,p.place_category
from maps_ugc_review m
left join place_info as p ON m.place_id=p.place_id)
,
cte2 as (select content_tag,place_category,count(place_category) as m
from cte
where content_tag= 'Off-topic'
group by place_category,content_tag
order by count(place_category) DESC)
,
cte3 as(select place_category,m,RANK() over(order by m desc) as t1
from cte2
)
select place_category
from cte3
where t1=1
----------------------
As a Data Analyst on the Google Maps User Generated Content team, you and your Product Manager are investigating user-generated content (UGC) – photos and reviews that independent users upload to Google Maps.
Write a query to determine which type of place (place_category) attracts the most UGC tagged as "off-topic". In the case of a tie, show the output in ascending order of place_category.
Assumptions:
Some places may not have any tags.
Each UGC upload with the "off-topic" tag will be counted separately.
place_info Table:
Column Name Type
place_id integer
place_name varchar
place_category varchar
place_info Example Input:
place_id place_name place_category
1 Baar Baar Restaurant
2 Rubirosa Restaurant
3 Mr. Purple Bar
4 La Caverna Bar
maps_ugc_review Table:
Column Name Type
content_id integer
place_id integer
content_tag varchar
maps_ugc_review Example Input:
content_id place_id content_tag
101 1 Off-topic
110 2 Misinformation
153 2 Off-topic
176 3 Harassment
190 3 Off-topic
Example Output:
off_topic_places
Restaurant
Explanation
The restaurants (Baar Baar and Rubirosa) have a total of has 2 UGC posts tagged as "off-topic". The bars only have 1. Restaurant is shown here because it's the type of place with the most UGC tagged as "off-topic".
The dataset you are querying against may have different input & output - this is just an example!