Fictionally Irrelevant.
As a Data Engineer, I craft data pipelines and explore the potential of AI, leveraging my diverse software engineering background gained in startups and consulting. When not building tech stacks, you'll find me with a book in hand, supporting Chelsea FC, or building my next side project 🧑💻 🚀
Current Side Project - Nemo
Building a custom document question-answering system using LLMs, Langchain, and FastAPI to create a tool that can answer questions based on documents.
More Stories
Introducing Nemo Hirefolio
Organize & track your job search. HireFolio is a new app that takes the stress out of your job search and helps you stay organized and efficient every step of the way.
Side Projects 2023
Over the years, I have worked on a variety of personal projects in my free time, ranging from web apps to machine learning to Data Engineering to Web3. In this collection, you will find a selection of my most interesting and successful projects, each with its own unique challenges and accomplishments. I hope you enjoy exploring them as much as I enjoyed creating them!
Sky Computing: Unleashing the Cloud's Full Potential
Imagine seamless data movement, transparent job routing, and an interconnected cloud ecosystem.
DuckDB - Accelerating Data Analysis with Speed and Simplicity
Discover the power of DuckDB, the lightning-fast analytics database for Python and R. Explore its unique features, seamless integration, and exceptional performance in handling large datasets.
Streamlined Data Ingestion With Pandas
As a data engineer, one of the most common tasks is to import and process data from various sources. Efficient data ingestion is crucial for smooth data workflows and analysis.
Unlocking Efficiency with LinkedLists in Python
Removing items from a massive list? Deleting elements from large lists using Python's default lists can be surprisingly slow. We'll reveal a smoother path with LinkedLists.
Get your own AI Generated MessiNFT 🐐
With Messi lifting the world cup, I decided to create a limited edition AI-generated Messi NFT using Flow and Cadence. Experience the future of art with our AI generated Messi NFTs on Flow. Created using Stable Diffusion and Dreambooth.
Introducing Nemo Thread
Are you tired of losing track of those amazing Twitter threads you come across? Do you wish there was an easier way to save and organize them? Introducing Nemo Threads – the solution to all your Twitter thread woes.
SignatureDAO
Mint your unique Signature NFT and join an incredible community of fellow signers who are dedicated to supporting the growth of the NFT, DeFi, and Metaverse industries.
Introducing Nemo Lofi
Nemo Lofi provides a soothing and relaxing soundtrack to help you focus and concentrate. Whether you're studying for an exam, working on a challenging project, or just need a break from the hustle and bustle of everyday life, the lofi music on Nemo will help to keep you calm and relaxed.
Data Engineer in 2022.
With the explosion of data generated by today's digital world, the demand for skilled data engineers is on the rise, and the field offers a wealth of exciting career opportunities.
Python: Implement Autocomplete feature using Trie.
Autocomplete is a common feature in modern search engines and applications, which provides suggestions and predictions as the user types in a query or keyword. In this article, we will explore how to implement an autocomplete feature using Python and a data structure called a Trie.
Data Engineer Roadmap 2021
A comprehensive guide to help you navigate the various skills and technologies needed to excel in the field. This roadmap aims to give a complete picture of the modern data engineering landscape and serve as a study guide for aspiring data engineers.
Multiprocessing with Pandas
Multiprocessing is a powerful tool for improving the performance of data analysis tasks, and Pandas is a popular Python library for working with structured data.By leveraging the power of multiple CPU cores, multiprocessing allows Pandas to split data processing tasks across multiple processes, resulting in faster and more efficient computation.
Scikit-learn's GridSearchCV and Pipeline to build Machine Learning models
Scikit-learn is a popular and widely-used Python library for machine learning and data analysis. In this article, we will explore two of the key features of Scikit-learn, GridSearchCV and Pipeline, and how they can be used to build machine learning models.
Hyperparameter tuning in Machine Learning using scikit-learn
Parameters which define the model architecture are referred to as hyperparameters and thus this process of searching for the ideal model architecture is referred to as hyperparameter tuning.
Merge Data from different sources with Pandas
Pandas is a powerful tool when it comes to data manipulation with Python. Most often than not, we have data stored in multiple files with some relation in between them.
CI/CD Deployment using Deta and Github Actions.
CI/CD, or Continuous Integration/Continuous Deployment, is a software development practice that involves the automated and frequent integration, testing, and deployment of software changes. In this article, we will explore how to use Deta and GitHub Actions to create a CI/CD deployment pipeline, and some of the key benefits and considerations of doing so.
Python: Functional Programming with Map, Compose and Pipe.
Python is a versatile programming language that is well-suited to a variety of programming paradigms. One of these paradigms is functional programming, which focuses on the use of functions to transform and manipulate data. In this article, we will explore some of the key concepts of functional programming in Python, including map, compose, and pipe.
Exploring Python's Specialized Dictionary
Python's built-in dict is a master of storing data, but did you know it has specialized siblings? Explore OrderedDict, defaultdict, ChainMap, and MappingProxyType.
Data Engineering: Storage Layer
Breaking down Eventual Consistency, how it is different from Strongly Consistency, when to use Eventual Consistency, and conflict resolution.
Ingest data into AWS.
Ingesting data into AWS is a common task for data engineers and analysts. In this article, we will explore some of the key methods for ingesting data into AWS, including batch and streaming options.
Data Engineering: Eventual Consistency
Consistency is a fundamental concept in distributed systems, and it refers to the state of data and information in a system at a given time. In this article, we will explore two of the key types of consistency in distributed systems, eventual consistency and strong consistency.
The Advent of MLOps.
Model deployment in machine learning has emerged as an intriguing field of research in recent years. You might have heard about MLOps. DevOps, GitOps, CI/CD, Kubernetes, Docker and Kubeflow before, let's take a look how these can be used with MLOps and how MLOps is different from tradational DevOps.
Passion is overrated, focus on Career Capital.
Have you ever noticed people buying into motivational vidoes and 'be financially independent', quiting their job to become full-time Youtuber, going for startup without investing in career captial. Let's dive deep inside and see how just banking on 'Follow Your Passion' might not be as shinny as it sounds.
Lessons learned from Joe Goldberg.
I've been bing watching Netflix's Pyscho triller, YOU. Having read the all the three novels and watched all the 3 seasons, here's what you can learn from Joe minus the killing part.
The 'Perfect Morning'.
Want to know how to make your morning perfect. Ever felt like you're always running to achieve so much but always come short?
Five Parts of Every Business - Lessons to learn before building your product.
Every business consists of five key parts that work together to create a successful and sustainable enterprise. In this article, we will explore the five parts of every business, and some of the lessons that entrepreneurs can learn before building their product.
Everything you need to know to upload Images to IPFS
IPFS, or InterPlanetary File System, is a decentralized and distributed file system that allows users to store and share files on a peer-to-peer network. In this article, we will explore how to upload images to IPFS, and some of the key considerations and best practices for doing so.
Game of Thrones - NFT
Game of Thrones is a popular American fantasy television series that aired from 2011 to 2019. In this article, we will explore how the world of Game of Thrones is being brought to the world of non-fungible tokens (NFTs). Non-fungible tokens, or NFTs, are digital assets that are unique and cannot be replicated or exchanged for other assets on a one-to-one basis. NFTs are built on blockchain technology and they are increasingly being used to represent and trade digital collectibles, such as artwork, music, and other forms of media.
Create NFT in Solana without writing a single line of code in Rust.
A web app that lets users connect their wallet, click mint, and receive a random anime NFT from our collection in their wallet. Metaplex is the NFT standard on Solana and has created a set of standardized tools and libraries to create NFTs. Over $1 billion in sales have been done so far on NFTs that use the Metaplex standard.
Wave-at-me
Getting started with web3 Dapp using React and Solidity. Web3 Dapps are decentralized applications (Dapps) that are built on top of the web3 stack, which includes technologies such as blockchain, smart contracts, and distributed storage. In this article, we will explore how to get started with building web3 Dapps using React and Solidity, two popular and powerful technologies for building Dapps.
Loop - Content Creators 3.0
A platform that helps you support your favourite creators without draining your pockets. Loop is a new platform for content creators that aims to revolutionize the way that creators produce, distribute, and monetize their content. In this article, we will explore some of the key features and benefits of Loop, and how it is transforming the world of content creation.
Companies innovating using blockchain.
Blockchain technology has the potential to revolutionize a wide range of industries, from finance and healthcare to supply chain management and beyond. In this article, we will explore some of the innovative companies that are using blockchain to drive change and drive growth.
Introducing Perp Connect - Trade on Perpetual Protocol using Non Custodial Wallet.
The most secure non custodial wallet. A non-custodial wallet is a decentralized type of wallet. Perp Connect does not transfer funds, or store your keys making it the most secure wallet app.
Use Brave if you're not using it already.
Three times faster than Chrome. Better privacy by default than Firefox. Uses 35% less battery on mobile.Unlike Chrome, Brave doesn't sell your data to the highest bidder
Why you should use Server-Side Events over Web Sockets and Long-Polling.
Server-Sent Events (SSE) is often overshadowed by its two big brothers — Web Sockets and Long-Polling. However, there are many practical use cases for using SSE.
What does threading in Python really means?
Threads are terrible, and you should never use them, right? Unfortunately, the situation is not so simple. We need to weigh the benefits and risks of using threads, just like with any technology choice.
Always keep your word.
Keeping your word is an important principle that can have a profound impact on your personal and professional life. In this article, we will explore the importance of always keeping your word and the benefits that it can bring.
Automate your ML workflow with no code Machine Learning.
No-code machine learning is a type of machine learning that allows users to create and deploy ML models without the need for coding or data science expertise. No-code machine learning platforms, such as WeBase, provide users with a range of pre-built and customizable ML models and tools, which can be used to automate common steps in an ML workflow, such as data preparation, model training, and deployment.
Why you should package your Machine Learning Project.
Packaging machine learning (ML) projects is the process of bundling the code, data, and dependencies of an ML project into a single, self-contained unit. Packaging ML projects has a number of benefits, including making it easier to share, deploy, and maintain ML projects. In this article, we will explore some of the key reasons why you should package your ML project.
MLOps - Continous Delivery for Machine Learning Models using Azure and Github Actions
MLOps is the practice of applying DevOps principles and practices to the development and deployment of machine learning (ML) models. One of the key challenges in MLOps is to create a continuous delivery pipeline for ML models, which can automate the process of building, testing, and deploying ML models.
Be productive with Nemo
Nemo is your little helper and companion no matter if you need to focus, tune out other noises or if you want to have a moment of calm and relax. Nemo is a digital platform that provides users with a customizable and personalized space for focus and productivity. This can be particularly useful for people who work in noisy or distracting environments, or who need to focus on specific tasks or projects.
MLOps for Edge Devices
MLOps is the practice of applying DevOps principles and practices to the development and deployment of machine learning (ML) models. MLOps for edge devices is a specific approach to MLOps that focuses on deploying and managing ML models on edge devices, such as sensors, cameras, and IoT devices.
Nemo - Your digital place for focus
Nemo is a new digital platform that aims to help people stay focused and productive in today's busy and distracting world. Nemo allows you to create your own personalized environment. Cancel background distraction and become more productive with Nemo.
Champions of Europe 2021
On May 29, 2021, Chelsea Football Club won the UEFA Champions League for the first time in their history. The Blues defeated Manchester City 1-0 in the final, thanks to a first-half goal from Chelsea midfielder Kai Havertz.
Data Orchestration with Airflow
Data orchestration is the process of managing and coordinating the flow of data between different systems and applications. Airflow is a popular open-source platform for data orchestration that allows users to define, schedule, and monitor data pipelines.
F.R.I.E.N.D.S — Data Analysis using Python
F.R.I.E.N.D.S is a popular American television sitcom that aired from 1994 to 2004. In this article, we will explore how to perform data analysis on the F.R.I.E.N.D.S dataset using Python
React + Django for Prototyping Modern Web Application
React and Django are two popular web development technologies that can be used together to create modern and dynamic web applications. In this article, we will explore how to use React and Django together for prototyping web applications, including the benefits and potential challenges of this approach.
Effective ways to communicate
Good communication is a must have skill If you want to be an effective leader, the time and effort you put into improving your communication skills is one of the best investments that you can make in your career.
Everything you need to know before starting with Machine Learning
Machine learning is a rapidly growing field that is transforming many industries and applications. In this article, we will explore some of the key concepts and considerations that you should know before starting with machine learning.
Introducing WeBase — Making deep learning simple
WeBase is a new deep learning platform that aims to make the process of building and training deep learning models simpler and more accessible without the need of machine learning knowledge or technical expertise.
A simple neural network for Image Classification
A very basic and easy to understand image classifier to get started with deep learning using images. We will be using Dense layer to set the baseline accuracy before moving to convolutional network and compare both the results.
Convert your favorite shot into any famous artistic painting using Pluto.
Use AI to convert your favorite shot into any artistic painting. Ever since Deep Neural Style Transfer was introduced in 2016 Neural Style Transfer by Leon Gatys et al, the world has been fascinated by the creative power of Deep Learning Algorithms. Following the footseps of many people who were curious to build their own implementation, I designed Pluto, to make it super easy for people to try my Neural Style Transfer.
When some one asks what is Deep Learning?
This post is the defination of deep learning in a nutshell. You may need some prior information or some basic idea of what a neural network is. This is not for complete begineers, but if you're familiar with the lingo, then this will help you to have a clearer defination of what deep learning is or does.
Getting started with CRUD API using Go
Creating, reading, updating, and deleting data (CRUD) are fundamental operations in many applications. In this article, we will explore how to create a CRUD API using Go, a popular and powerful programming language.
HTTP Requests with Go
In this blog post, were going to see how we can make http requests using Go. We cover all the basic http requests like GET, POST and others. How to use json encoder and decoder and other things.
Glorified Hello World with Go
A hitchinker guide to all the basics of Golang. This post is all you need to get started with the Go programming language. It covers all the basics in depth to help you get started with Go. Go is a popular and powerful programming language that is well-suited to a wide range of applications.
Demystifying GitHub API to Fetch the Top 3 Repositories by stars using Node.js
The GitHub API is a powerful tool for data analysts and engineers who want to retrieve and analyze data from the popular code-hosting platform. In this article, we will explore how to use the GitHub API to fetch the top 3 repositories by stars using Node.js
The Rise of the Robo Advisors
I had the opportunity to work as a software engineer intern for a startup working on Robo advisor. The rise of robo advisors, or automated investment platforms, has been one of the most significant developments in the financial services industry in recent years. In this article, we will explore the rise of robo advisors, and some of the key factors that have contributed to their growth and success.
Change background audio of a video— The Pythonic Way
Editing the audio of a video is a common task for data analysts and engineers. In this article, we will explore how to change the background audio of a video using Python, one of the most popular and versatile programming languages.
Extract Screenshots from Video — The Pythonic Way
Extracting screenshots from videos is a common task for data analysts and engineers. In this article, we will explore how to extract screenshots from videos using Python, one of the most popular and versatile programming languages.
A Comprehensive Guide to Reading Different File Formats Using PySpark
In the world of data analytics, PySpark emerges as a powerful tool for handling diverse file formats, enabling seamless data ingestion and analysis. Its comprehensive read functions effortlessly transform raw data into analyzable DataFrames, unlocking the insights hidden within.
A Comprehensive Guide to Install Discourse for Development on Docker Using Vagrant
This article is for all the newbies or beginners trying to install Discourse for development with absolutely no knowledge of vagrant, docker, ruby, or PostgreSQL on Windows 10 Home machine.
Know any Dataset in 4 Lines of Python
Data has always been used to empower smarter decision-making process. When solving any machine learning problem the first thing a data scientist does is Exploratory Data Analysis (EDA). This is the first step towards solving any machine learning regression or classification problem.
Introduction to Agile for Software Engineers
Have you ever wondered how the agile team organizes their software development process? Having been working in a young startup as a software engineer intern I learned a thing or two about agile.
Why Should I Care about Software Architecture?
Evolution is about the process of having a system that is fit for purpose and can survive the ever-changing environment in which it operates. The same goes for software development as more people are realizing the central role of software systems in our twenty-first-century human world.
Data Science- The Sexiest Job of 21st Century
We have been hearing a lot about data science. Every company is hiring a Data Scientist. Harvard labeled the profession “the sexiest job of the 21st century”.