Data & Analytics, Software development
Reading text from pdf using OCR Technique (Python) Why OCR (Optical Character Recognition)? We can also use the PyPDF2 python library to get text from PDF. But there is a major problem with this library. - It will not give you a good result if the data in the pdf is not structured. - You can lose some data. To overcome this...
Do you know how Ansible works? Well, before we walk you through how to setup dynamic inventory, here's some food for thoughts on Ansible Configuration Management System A user using configuration management system will often want to save inventory in a different software system. As described in inventory, a basic text-based system is...
We usually come across few scenarios where we need something to run on the remote servers for the sake of automation. I came across such a use case in AWS where I need to run fabric on a server to be triggered from AWS Lambda. This fabric script further does few tasks on other servers. Background/Scenario: While working on a spot...
Python is readable, dynamic, pleasant, fast, flexible and powerful language. It is a programming language which helps you work quickly and integrate systems more easily. LANGUAGE FEATURES Interpreted It executes instruction directly without any compilation like in C, C++, Java. Python converts the source code into bytecode...
Django is a high-level Python Web framework that can help you get your Python application or website off the ground. It includes simple development server for testing the code locally, but in the production environment, a more secure and powerful web server is required. uWSGI is an application server container that aims to provide...
We have been working on a scenario where we want to automate testing, build, deploy and revert in one Jenkins job. Currently, we are using separate Jenkins jobs for testing, deploying and reverting the code changes. We would be automating these tasks using AWS CodeDeploy with Jenkins. Jenkins can provide us the functionality to run...
In the previous blog we have discussed what is locust , why we use locust for load testing and how to implement it. So if you are wondering what locust is and how it works or for a better understanding, see locust .Now in this blog we are here to discuss about running locust in distributive way. For a single machine it is not enough...
In my previous blog post, I wrote about Cost Optimizations for the various AWS resources. The script consisted of six methods and the overall time required for the script to execute was roughly about 15-20 mins. While the script is running, it used to slow down my system. Also, there was a time consumption due to many reasons like...
As we know Python supports multiple approaches for concurrent programming with threads, sub-processes and some other ways which could help achieving solutions built on multiple CPUs or multi-core CPU. I tried implementing something similar on my existing use case for AWS Security Re-Check where I was running a check on...
This would be my first blog where I am using Python. I am excited to share what I did using Python. I wanted to update google sheet using bash shell scripting but did not found anything I can play around using bash. Later, I opted for Python as it provides "gspread" library which could be used to work on google sheet. ...
As we know AWS provides a good edge in terms of cost over the on-premises data center or in a co-location environment with its On Demand and Reserved pricing. As it is rightly said "Reducing the overall cost is a high priority" and it is true for any organization whether big or small. By using AWS we can lower the IT costs, compute...
Autoscaling is a service in AWS, which is used to launch or terminate an instance based on user-defined policies, health checks, and schedules. There are several ways to configure an auto-scaling group in AWS, here we are focusing on implementing it in python using AWS python module boto. Before Creating an Autoscaling Group ...