Question - Creating a Quick Data Pipeline
Let's create a quick EDA pipeline using pandas
Hi,
Suppose you're given the following dataframe:
Using this data, write a data processing pipeline to perform the following actions to the data:
Groups the dataframe by a specified column and returns the mean age of the group
Converts the column name to uppercase
If you're using Python, you can build the dataframe using the below code:
— Answer for the previous article:
with helper as (select candidate_id from candidates where lower(skill) in ('python','tableau','postgresql'))
select candidate_id from helper group by candidate_id having count(*)>=3 order by candidate_id