blog
Transactional Data Lakes Comparison
The interest in data lake has been increasing as more features are being added to the data lake frameworks and cloud services availability. In this article we will discuss the comparison between transactional data lakes - Apache Hudi, Delta Lake and Apache Iceberg and AWS Lake Formation.
Posted December 1, 2022 by Rohith ‐ 7 min read
Data Lake Layer Recommendations
Data lake allows use to store variety data at low cost. However, over the time it can become difficult to maintain as data grows. It can create data duplication, inefficient resource usage. In this article, we will understand the data lake layers and recommendations for effectively maintaining data in data lakes with examples.
Posted December 1, 2022 by Rohith ‐ 3 min read
zip and Unzip in Python
zip is used to compress the file. unzip is used to decompress the compressed file. In this article we will zip and unzip operation in python.
Posted October 18, 2022 by Rohith ‐ 1 min read
Move S3 Files Using Python Boto3
Using python boto3 module, we can move the s3 files from one bucket to another bucket or with in the same bucket. In this article we will explain how to achieve with an example.
Posted October 4, 2022 by Rohith ‐ 1 min read
Using Jinja With Python
In this article we will shows how to create templates in Python with Jinja module.
Posted September 30, 2022 by Rohith ‐ 4 min read