NP
Narayani Patil

Hi, I'm Narayani Patil

$data_engineer && ml_enthusiast_

I transform complex data into meaningful insights and build scalable data pipelines.

About Me

Get to know more about my background and expertise

Narayani Patil
# About Narayani Patil
class DataEngineer:
    def __init__(self):
        self.name = "Narayani Patil"
        self.role = "Data Engineer & ML Enthusiast"
        self.location = "Boston, MA"
        self.education = ["MS Computer Engineering", "BE Information Technology"]
        self.skills = ["Python", "SQL", "Big Data", "ML/AI", "Cloud"]
        
    def transform_data(self, raw_data):
        return meaningful_insights  # Magic happens here!

I'm a passionate Data Engineer and Computer Engineering graduate student at Northeastern University with expertise in designing and implementing data pipelines, ETL processes, and analytics solutions.

With professional experience at Natixis Investment Managers and Deloitte, I've developed skills in Python, SQL, big data technologies, and machine learning. I'm currently working as a Teaching Assistant for Machine Learning Operations at Northeastern University.

Education

Northeastern University, Boston, USA

Master of Science in Computer Engineering

Expected Aug 2025

University of Mumbai, Mumbai, India

Bachelor of Engineering in Information Technology

Aug 2017 – Jul 2021

Data Engineering
Machine Learning
Big Data
Python
SQL

Professional Experience

My journey in the field of data engineering and analytics

Teaching Assistant – Machine Learning Operations
Northeastern University, Boston, USA
Dec 2024 – Present
  • Working on a RAG based chatbot leveraging LangChain, LLMs, vector databases, airflow and model pipelines.
  • Assisting 200+ students in understanding MLOps concepts, model versioning/deployment and operationalization.
Data Science Engineer Co-Op
Natixis Investment Managers (GROUPE BPCE), Boston, USA
Aug 2024 – Dec 2024
  • Developed and productionalized 5+ end-to-end distributed data pipelines from sales, marketing, and product datasets stored in PostgreSQL, enabling leadership to make data-driven decisions through Tableau dashboards.
  • Optimized the performance of existing pipelines, by migrating them to DBT enhancing processing speed by 30%.
  • Executed statistical techniques to scale a dataset containing millions of records to identify data trends & patterns.
  • Built Python wrappers and scripts to automate management of multiple project dependencies & Git repositories.
Data Analyst
Deloitte, Mumbai, India
Sep 2021 – Jun 2023
  • Developed pipeline, extracting user and project data from 300+ Dataiku projects across diverse Dataiku servers.
  • Improved data-driven decisions by 30% by analyzing user data using time-series model and PowerBI reports.
  • Modeled clinical trial healthcare datasets, generating critical insights that directly shaped client decision-making.
  • Led a team to restructure cluster storage, improving system efficiency by 40%.
  • Awarded SPOT Award for productionalizing 10+ Dataiku pipelines, ensuring accuracy and minimal downtime.

Technical Projects

Here are some of the projects I've worked on as a Data Engineer

NewsNest: Web Data Extraction
NewsNest: Web Data Extraction and Management

Built & deployed real-time pipeline on GCP. Using BeautifulSoup and Selenium, scraped the latest news, videos. Architected this data flow by leveraging Airflow, vector databases, Kafka, OpenAI, docker and data warehouses.

Data Engineering with Snowpark Python
Data Engineering with Snowpark Python
Cloud Data Engineering

Implemented data engineering solutions using Snowpark Python for efficient data processing and analytics. Leveraged Snowflake's capabilities for scalable data operations and transformations.

Intelligent Knowledge Retrieval & Q/A
Intelligent Knowledge Retrieval & Q/A
AI-Powered Information Retrieval

Developed an intelligent application for knowledge retrieval and question answering tasks. Implemented advanced NLP techniques to enable efficient information extraction and accurate responses.

Technical Skills

My expertise across various technologies and domains

Programming & Databases

Python

Java

SQL

R

C

Scala

Golang

C++

Paper Publications

My research contributions to the academic community

Real Time Crowd Surveillance using Machine Learning
2nd GCAT IEEE Conference, Oct 2019
AI Powered Farming and Crop Recommendation System
2nd INCET IEEE Conference, May 2021
X-ray Image Analysis to Detect Pulmonary Disease with Deep CNN
IJPE Scopus Journal, Aug 2020
Performance Analysis of Activation Function on a Shallow Neural Network
JETIR Journal, Jun 2020