Question - Creating a Quick Data Pipeline

Let's create a quick EDA pipeline using pandas

Tushar Goel

Apr 15, 2023

Hi,

Suppose you're given the following dataframe:

Using this data, write a data processing pipeline to perform the following actions to the data:

Groups the dataframe by a specified column and returns the mean age of the group
Converts the column name to uppercase

If you're using Python, you can build the dataframe using the below code:

— Answer for the previous article:

with helper as (select candidate_id from candidates where lower(skill) in ('python','tableau','postgresql'))

select candidate_id from helper group by candidate_id having count(*)>=3 order by candidate_id

Interview ML

Question - Creating a Quick Data Pipeline

Let's create a quick EDA pipeline using pandas