Prince Yadav

Python

Automating ETL Workflows with Apache Airflow: An Universal Solution for Data Management

Introduction Vast about of data is being processed daily in various manners like financial transactions, customer interactions, sensors, research results, and so on. For instance, pharmaceutical companies produce millions of data points just from clinical trials alone. This huge amount of data has to be handled with accuracy and speed....

02-Sep-2024

Data & Analytics, Software development

Text Extraction from pdf using OCR (Optical Character Recognition ) in Python

Reading text from pdf using OCR Technique (Python) Why OCR (Optical Character Recognition)? We can also use the PyPDF2 python library to get text from PDF. But there is a major problem with this library. - It will not give you a good result if the data in the pdf is not structured. - You can lose some data. To overcome this...

29-Aug-2023

AWS, Technology

Creating Alexa skill using AWS lambda as backend

Amazon Alexa is a virtual assistant developed by Amazon, available on a range of devices including the Amazon Echo, Echo Dot, and Echo Show. One of the key features of Alexa is the ability to use “skills” to extend its functionality and allow users to interact with it in new ways. Creating a skill for Alexa is relatively simple and...

25-Mar-2023