blog
date_trunc in Python Pandas
date_trunc is the date function used to truncate a date or datetime value to the start of a given unit of duration. The function helps in truncating the date column to year, decade, century, quarter, month, week, day, hour, minute, second, or millisecond. In this article, we will truncate the date column using python pandas.
Posted August 23, 2022 by Rohith ‐ 1 min read
Coalesce in Python Pandas
The coalesce function returns the first non-null value from a series of given columns in sql. In this article, we will perform coalesce operation on python pandas dataframe.
Posted August 23, 2022 by Rohith ‐ 1 min read
Get Parquet Schema Using Python
Parquet is widely used in data transformations. Every parquet file has schema associated with it. As it is a binary file, we cannot read the data using any text editor. In this article, we use pyarrow python package to extract the parquet schema.
Posted August 17, 2022 by Rohith ‐ 1 min read
Useful AWS S3 CLI Commands
AWS S3 cli is useful tool while operating on s3 object store. This article list useful aws s3 cli commands frequently used. It can be a quick reference to anyone who is looking for quick solutions.
Posted August 17, 2022 by Rohith ‐ 2 min read
Aws Available Regions
It is always easy to have a lookup table for aws available regions and region code. This article is a quick reference to available aws regions and their respective codes.
Posted August 17, 2022 by Rohith ‐ 1 min read