Blog

Transactional Data Lakes Comparison

The interest in data lake has been increasing as more features are being added to the data lake frameworks and cloud services availability. In this article we will discuss the comparison between transactional data lakes - Apache Hudi, Delta Lake and Apache Iceberg and AWS Lake Formation.

Posted December 1, 2022 by Rohith ‐ 7 min read

data-lake differences transactional-data-lake blog

Data Lake Layer Recommendations

Data lake allows use to store variety data at low cost. However, over the time it can become difficult to maintain as data grows. It can create data duplication, inefficient resource usage. In this article, we will understand the data lake layers and recommendations for effectively maintaining data in data lakes with examples.

Posted December 1, 2022 by Rohith ‐ 3 min read

data-lake blog

zip and Unzip in Python

zip is used to compress the file. unzip is used to decompress the compressed file. In this article we will zip and unzip operation in python.

Posted October 18, 2022 by Rohith ‐ 1 min read

quick-references python gzip blog

Move S3 Files Using Python Boto3

Using python boto3 module, we can move the s3 files from one bucket to another bucket or with in the same bucket. In this article we will explain how to achieve with an example.

Posted October 4, 2022 by Rohith ‐ 1 min read

quick-references python boto3 s3 blog

Using Jinja With Python

In this article we will shows how to create templates in Python with Jinja module.

Posted September 30, 2022 by Rohith ‐ 4 min read

quick-references blog

Subscribe For More Content