Blog posts around Data Engineering

Using AI to build my first Android app: What worked and what didn’t

Initial Thoughts Having spent over a decade working in software development and data engineering, I thought, where is AI right now? Is it capable of eliminating the developer or is there still some time? So, I challenged myself with building an Android app. That was an unfamiliar area for me. While it intrigued me, there […]

by Abhishek Chauhan

May 25, 2025

Data Engineering

Automating Code Reviews Using OpenAI and GitHub

The State of Code Reviews in Today’s Development Landscape: In today’s fast-moving world of software development, AI has made remarkable progress. It can write code, debug errors, and even help design architectures. But let’s be honest, we’re not quite at a point where AI can take over the entire development process. Human developers are still […]

by Kedhar Praveen Natekar

May 5, 2025

Data Engineering

Snowflake Document AI : Unlocking Insights from Unstructured Data

Fun fact! Around 80%-90% of the world’s data is unstructured. I was shocked when I read this fact. The unstructured data contains images, emails, PDF files social media posts, and other formats. Even though it is widely present 70% of data is not being used to drive insights and get analytics. As a Data Engineer, […]

by Akshay Vijay Girulkar

March 9, 2025

Data Engineering

Digital Engineering in Sales: Transforming Strategies for Modern Success

Introduction In today’s era, businesses face the challenge of adapting to advanced technologies to stay ahead of their peers. Digital Engineering has emerged as a game changer, which helped in integrating services like automation, data analytics, cloud computing, AI/ML, and IoT into engineering and business processes. Traditionally it is associated with product development but gradually […]

by Prabhpreet Kaur

January 13, 2025

Data Engineering

Mastering Data Modeling

As you progress in your journey from business intelligence (BI) development toward data engineering or analytics engineering, one of the core skills you need to focus on is data modeling. Data modeling is the foundation for any data architecture—whether you are building databases, designing ETL pipelines, or creating data warehouses. Without a solid understanding of […]

by Karishma Singhal

November 28, 2024

Data Engineering

Unlocking the Secrets to the Perfect Database Choice

Introduction In today’s data-driven world, the choice of a database can significantly impact the performance, scalability, and maintainability of your application. With so many types of databases available, selecting the right one can be a daunting task. This guide will help you understand the key factors to consider when choosing a database and provide a […]

by Sindhura

October 12, 2024

Data Engineering

RSS FEED PARSING using PySpark

Introduction An RSS (Really Simple Syndication) feed is an online file that contains details about each piece of content a site has published. RSS feeds are a common way to distribute updates from websites and blogs. These feeds are often provided in XML format, and Python offers several tools to parse and extract information from […]

by Ashita Kumar

October 7, 2024

Data Engineering

Getting Started with Testing Scala Spark Applications Using ScalaTest

Testing is an essential aspect of software development, especially for big data applications where accuracy and performance are crucial. When working with Scala and Apache Spark, testing can get challenging due to the distributed nature of Spark and the complexity of data pipelines. Fortunately, ScalaTest provides a robust framework to write and manage your tests […]

by Rakesh Choudhary

September 30, 2024

Data Engineering

Configuring AWS Lambda as a Kafka Producer with SASL_SSL and Kerberos/GSSAPI for Secure Communication

Kafka is a distributed streaming platform designed for real-time data pipelines, stream processing, and data integration. AWS lambda, on the other hand, is a serverless compute service that executes your code in response to events, managing the underlying compute resources for you. In organizations where Kafka plays a central role in streaming and data integration, […]

by Avinash Upreti

September 30, 2024

Blogs

Tips for writing a blog

Learn how to write a caption