top of page
Search
Git
Overview Git is a version control system that manages and tracks changes in projects. Unlike traditional systems, Git stores data as snapshots of the project rather than as a series of file changes. This approach allows Git to perform most operations locally, making it fast and efficient. It ensures data integrity through checksums and primarily adds data, making it difficult to lose information. Git operates with three main states: modified, staged, and committed, correspond

Rohan Roy
Jan 104 min read


Understanding Beam Search and Its Applications in Machine Learning
Machine learning models often face the challenge of generating sequences, such as sentences, code, or even music. Selecting the best possible sequence from many candidates is a complex task. Beam search is a powerful algorithm that helps models find high-quality sequences efficiently. This post explains what beam search is, how it works, and where it is used in machine learning. What Is Beam Search? Beam search is a heuristic search algorithm used to explore possible sequence

Rohan Roy
Jan 104 min read


Understanding Perceptron: The Building Block of Neural Networks
The perceptron is one of the simplest yet most important concepts in machine learning. It laid the foundation for modern neural networks and deep learning, which power many technologies we use today. Understanding the perceptron helps machine learning enthusiasts grasp how computers can learn from data and make decisions. This post explains what a perceptron is, how it works, and why it remains relevant. What Is a Perceptron? A perceptron is a type of artificial neuron, inspi

Rohan Roy
Dec 31, 20255 min read


Bare Metal vs Virtual Machines vs Containers Which is Right for Your Infrastructure
Choosing the right infrastructure for your applications can be challenging. With options like bare metal servers, virtual machines (VMs), and containers, each offers unique benefits and trade-offs. Understanding these differences helps you build a system that fits your performance needs, budget, and management preferences. This post breaks down the key features of bare metal, VMs, and containers. It explains how they work, their advantages and disadvantages, and when to use e

Rohan Roy
Dec 17, 20254 min read
Liquibase
What is Liquibase? Liquibase is an open-source "version control for your database." Just as Git tracks changes to your application code, Liquibase tracks changes to your database schema (tables, columns, indexes). In traditional development, managing database changes is often manual and risky: one developer runs a CREATE TABLE script locally but forgets to give it to the team, or the production database drifts out of sync with development. Liquibase solves this by treating

Rohan Roy
Dec 17, 20252 min read
Cassandra
Apache Cassandra is a distributed, NoSQL database engineered for large-scale, real-time applications with high availability ,...

Rohan Roy
Sep 3, 20252 min read
Object Class and its Equals & HashCode methods
Object Class (exists excerpts from docs.oracle.com) Object class is part of the java.lang package. Class Object is the root of the class...

Rohan Roy
Dec 8, 20222 min read
Anaconda Useful Tips
Create new environment run the following command in conda prompt: conda create -n envName python=3.7 Here "envName" is the name of the...

Rohan Roy
Jul 22, 20211 min read
Splunk Overview
(contains excerpts from docs.splunk.com) Splunk is a log processing platform which stores compressed logs and indexes them for faster...

Rohan Roy
Jul 10, 20213 min read
Indexes, indexers, and indexer clusters
Following is an excerpt from docs.splunk.com: The index is the repository for Splunk Enterprise data. Splunk Enterprise transforms...

Rohan Roy
Jul 10, 20214 min read
Spring Boot Essentials
Spring Boot has turned out to be the most sought after framework on java for application development. In this blog I tried to get a gist...

Rohan Roy
May 13, 20211 min read
EC2
Elastic Compute Cloud(EC2) is a web service that provides resizable compute capacity in the cloud. Virtual computing environments, known...

Rohan Roy
May 12, 20212 min read
Macie
AWS Macie is a security service which uses Machine Learning and NLP to discover, classify and protect sensitive data stored in S3. Uses...

Rohan Roy
May 11, 20211 min read
Athena
Athena is an interactive query service which is used to analyze and query data located in S3 using standard SQL. Athena is serverless,...

Rohan Roy
May 10, 20211 min read
Storage Gateway
Storage Gateway connects an on-premise software appliance with cloud-based storage to provide seamless and secure integration between an...

Rohan Roy
May 10, 20211 min read
Snowball
Snowball is a petabyte-scale data transport solution that uses secure appliances to transfer large amounts of data into and out of AWS....

Rohan Roy
May 10, 20211 min read
CloudFront
CloudFront is a CDN It's a global service, not specific to any region. Edge Location: location where content will be cached and is...

Rohan Roy
May 9, 20211 min read
S3 - Simple Storage Service
S3 is an object based storage. Files stored in s3 can be from 0 Byte to 5 TB. Storage is unlimited in S3. Files are created in buckets....

Rohan Roy
Jan 4, 20212 min read
Identity & Access Management
Identity and Access Management in AWS allows us to create 1. users 2. groups and 3. roles Users are physical users, who would login to...

Rohan Roy
Jan 4, 20211 min read
Identity Federation
Identity Federation Identity Federation refers to a system that allows the sharing of identity information across different domains or organizations. This enables users to access multiple services or applications using a single set of credentials, enhancing both security and user experience. Identity federation emerged towards the end of the first decade when Facebook and Google allowed various sites to utilize their services for authorization to access specific user data. B

Rohan Roy
Oct 21, 20192 min read
bottom of page