About
Born and raised in Mumbai, India, my journey into the captivating realm of computers began in childhood. Fueled by an innate passion for exploration, I embarked on a lifelong quest to understand and engage with the ever-evolving world of technology. This early fascination has been a driving force, shaping my path and propelling me towards a fulfilling and dynamic career in the field.
MS | Data Science | Stevens Institute of Technology, Hoboken, US
Master's candidate in Data Science at Stevens Institute of Technology with 2 years of industry experience as a Senior Consultant in Cloud and Automation at LTIMindtree.
- Birthday: 6 May 1999
- Website: www.akshayparate.com
- Phone: +1 (551)331-3971
- City: Jersey City, USA
- Age: 24
- Degree: Master
- Email Id: aparate@stevens.edu
My professional journey took me to Saudi Arabia, where I dedicated a year to working with Al Rajhi Bank. Amidst the demands of my roles, I channel my passion for NLP into developing cutting-edge LLM applications.
Beyond the confines of the classroom, I have actively pursued my academic drive, acquiring advanced certification in blockchain technology from IIIT Bangalore. Versatility characterizes my skill set as I proficiently navigate through languages such as Java, Python, JavaScript, TypeScript, and Solidity in my projects.
Skills
Resume
Summary
Akshay Parate
Akshay Parate is a skilled data scientist and developer with advanced expertise in Python, PySpark, and machine learning frameworks such as TensorFlow and PyTorch. Currently pursuing an M.S. in Data Science at Stevens Institute of Technology, Akshay brings a robust educational foundation and hands-on experience in data preprocessing, ETL pipelines, and building AI-driven solutions.
- Central Ave, Jersey City, NJ, 07307
- (551) 331-3971
- Email Id: aparate@stevens.edu
- LinkedIn: akshay-parate-b49169171
- Github: akshayparate123
Education
Master of Science in Data Science
2023 - 2025
Stevens Institute of Technology, Hoboken, NJ
Relevant coursework: Applied Machine Learning, Natural Language Processing ,Deep Learning, Statistical Methods, Optimization, Introduction to Financial Risk Management, Big Data Technologies, Time Series Analysis
Bachelor of Technology in Electronics and Telecommunication Engineering
2018 - 2021
K.J. Somaiya College of Engineering, Mumbai, India
Relevant coursework: Applied Mathematics, Data Analysis, and Interpretation.
Diploma in Electronics and Communication Engineering
2015 - 2018
Dr. DY Patil Polytechnic, Mumbai, India
Certifications
Advanced Certification Programme in Blockchain Technology
2021 - 2022
International Institute of Information Technology, Bangalore, India
Relevant coursework: AWS, Linux, JavaScript, Blockchain Technology, NodeJs, Solidity
Professional Experience
Research Assistant
2023 - 2025
Hoboken, NJ, USA
- Assisted in the collection, cleaning, and preprocessing of raw large scale datasets, ensuring data quality and integrity
- Developed data crawler for continuous data scraping and PySpark implementation for fast data transformation.
- Designed workflows for transforming unstructured data into structured formats for data analysis using ETL tools.
- Collaborated with teams to develop retrieval augmented generation (RAG) pipeline to reduce hallucinations.
- Designed and developed image retrieval and generation pipeline for multimodal LLMs using Stable Diffusion model
- Supported the integration of processed datasets into ML models, enhancing predictive accuracy and performance
Senior Consultant
2021 - 2023
LTIMindtree, Riyadh, Saudi Arabia
- Implemented DevOps (CI/CD) to enhance the project's ability to deliver applications and services at high velocity.
- Utilized Python for in-depth data analysis of production server traffic, contributing to enhanced server responsiveness.
- Developed a machine learning algorithm for a decision system that dynamically scaled servers based on real-time loads.
- Frequently delivered code by introducing automation into the stages of app development using Python.
- Collaborated with cross-functional teams to ensure seamless integration of solutions.
- Implemented Linux and Ansible scripts for health checks of non-production servers.
- Automated routine tasks using Python for ease of EM team
Python IOT Intern
2019 - 2020
K.J. Somaiya College of Engineering, Mumbai, India
- Implemented Python automation scripts for IOT-based smart irrigation system
- Integrated Raspberry pi with multiple sensors.
Telecommunication Technology Intern
2018 - 2018
Jawaharlal Nehru Port Terminal, Mumbai, India
- Assist in monitoring and maintaining telecommunications networks to ensure optimal performance.
- Identify and troubleshoot network issues to minimize downtime and disruptions.
Projects
SKY : Personal AI Assistant (Award Winning Project)
- Designed and developed SKY Personal Assistant, an AI-driven virtual assistant focused on enhancing productivity through advanced features like question answering, information retrieval, and task automation.
- Integrated a Retrieval-Augmented Generation (RAG) pipeline utilizing knowledge graph learning, boosting result accuracy by 40% through contextual similarity methods and query rewriting.
- Built robust ETL pipelines using Apache Airflow for efficient data extraction, transformation, and loading, leveraging Hadoop for scalable storage and processing.
- Optimized data storage and retrieval solutions using Hive and relational databases for seamless query processing and analysis.
- Implemented Optical Character Recognition (OCR) pipelines to extract text from images, dividing images into smaller chunks for higher accuracy.
- Leveraged pre-trained BART model for text summarization, translation, and question answering, fine-tuning it for domain-specific applications.
- Used diverse datasets for training, including financial QA, medical QA, emotion recognition, and title generation, ensuring model adaptability across use cases.
- Designed RAG pipelines to retrieve, rank, and augment user queries with relevant data, improving LLM-generated response quality.
- Implemented a continuous learning system for SKY Personal Assistant, enabling personalized user interactions and adaptive responses based on user feedback.
- Planned future enhancements, including integration of computer vision, advanced data analysis, and expanded task automation capabilities.
- Backend Development: Python, PySpark, MySQL
- ETL and Data Processing: Apache Airflow, Hadoop, Hive
- AI and Machine Learning: BART, PyTorch, TensorFlow, RAG Pipeline
- OCR Implementation: Optical Character Recognition for text extraction
- Data Storage and Databases: Hive, Relational Databases
- Training and Datasets: Financial QA, Medical QA, Emotion Recognition, Title Generation
Analysis of Adverse Drug Effects using Big Data and Cloud Computing
- Conducted large-scale analysis on over 10 million healthcare data points using PySpark for efficient big data processing and analysis.
- Implemented advanced machine learning techniques such as XGBoost and ensemble methods to uncover critical patterns related to adverse drug effects.
- Enhanced drug safety and improved healthcare outcomes by identifying significant correlations and risk factors in the data.
- Preprocessed large datasets by cleaning, normalizing, and encoding categorical variables to prepare them for analysis using PySpark.
- Performed feature engineering to extract meaningful features that improved the predictive accuracy of machine learning models.
- Deployed the complete Spark application on Google Cloud Platform (GCP) for scalable and distributed computing.
- Utilized Amazon S3 for secure and scalable storage of large healthcare datasets, ensuring smooth integration with cloud-based processing workflows.
- Designed a pipeline for efficient data transfer between storage (S3) and computation clusters, optimizing runtime and cost-efficiency.
- Generated actionable insights through model interpretation and validation, providing meaningful contributions to healthcare decision-making processes.
- Leveraged cloud computing and big data technologies to demonstrate how scalable solutions can be used for real-world applications in drug safety analysis.
- Backend Development: Python, PySpark, MySQL
- Big Data Technologies: PySpark, Hadoop
- Machine Learning: XGBoost, Ensemble Methods
- Data Preprocessing: Data Cleaning, Feature Engineering, Encoding Categorical Variables
- Cloud Computing: Google Cloud Platform (GCP), Amazon S3
- ETL and Data Processing: Spark Pipelines, Distributed Computing
- Data Storage: S3 Bucket, Relational Databases
- Data Analysis: Pattern Identification, Predictive Modeling
- Scalable Solutions: Cluster Deployment, Parallel Processing
- Outcome Validation: Model Interpretation, Insight Generation
My Portfolio Application
- Spearheaded end-to-end development, overseeing the entire project lifecycle from conception to implementation.
- Implemented the widely acclaimed ABC strategy for investment recommendations, leveraging Python for data analysis and JavaScript for frontend enhancements.
- Conducted in-depth market trend analysis, providing valuable insights for strategic decision-making by users.
- Developed an intuitive and user-friendly stock chart, incorporating HTML, CSS, and JavaScript to display critical data such as targets and stop-loss levels, significantly enhancing the platform's usability.
- Successfully integrated features for comprehensive portfolio management, empowering users with a powerful tool to track and optimize their investments effectively.
- Utilized MySQL for database management, ensuring efficient and secure data storage and retrieval.
- Applied strong financial knowledge to enhance the platform's functionality and align it with the needs of investors.
- Demonstrated effective project management skills, ensuring timely delivery and meeting project objectives.
- Backend Development: Python
- Framework: Flask
- Frontend Development: HTML, CSS, jQuery
- Database Management: MySQL
- API Integration: Binance API, Fyers API
- Link to the project
Book Selling Web Application using Recommendation System
- Directed the end-to-end development of a dynamic Book Selling Application aimed at bridging the gap between college seniors and juniors, offering an affordable avenue for students to access required books.
- Implemented advanced machine learning technology along with Java and Python programming languages to analyze users' previous order history, delivering tailored book suggestions for an enhanced user experience.
- Led the complete development lifecycle of the Book Selling Application, ensuring successful project delivery.
- Implemented advanced machine learning technology using Python to analyze users' previous order history, providing personalized book suggestions and enhancing the overall user experience.
- Utilized Java, HTML, CSS, and JavaScript for full-stack development, creating an intuitive and user-friendly interface.
- Integrated Paytm for seamless and secure payment processing within the application, enhancing the platform's functionality.
- Employed MySQL for efficient data storage and retrieval, ensuring the smooth operation of the application.
- Implemented Firebase Storage for storing PDF versions of books, enabling users to access digital copies conveniently.
- Demonstrated effective project management skills, ensuring timely development milestones and project objectives were met.
- Backend Development: Java, Python
- Frontend Development: HTML, CSS, JavaScript
- Database Management: MySQL
- Data Analysis and Machine Learning: Python
- Payment Integration: Paytm
- File Storage: Firebase Storage
Automated Trading Bot
- Developed a fully automated crypto trading bot, eliminating the need for human intervention in trade execution.
- Implemented a specific trading strategy using JavaScript, leveraging the Binance API for seamless and automated trade execution.
- Utilized Node.js for the backend, ensuring scalability and efficiency in handling the bot's operations.
- Created a user-friendly front-end interface with HTML, CSS, and jQuery, allowing users to monitor real-time trade data, order history, and profits.
- Established a MySQL database to store and manage trading data efficiently.
- Demonstrated proficiency in deploying software, ensuring the successful implementation of the trading bot.
- Achieved outstanding profitability, with returns surpassing those of most traditional stock market funds by tenfold.
- The success of the bot attributed to its ability to analyze market trends, execute trades based on the defined strategy, and eliminate emotional decision-making and human errors.
- The project showcases efficient and reliable trading capabilities, positioning the bot as an excellent investment tool for cryptocurrency enthusiasts.
- Backend Development: Node.js, JavaScript
- Frontend Development: HTML, CSS, jQuery
- Database Management: MySQL
- API Integration: Binance API
- Software Deployment: Python
The Drug Counterfeiting Problem using Blockchain
- Led the development of a blockchain platform to combat drug counterfeiting, ensuring a secure and transparent drug supply chain.
- Utilized Hyperledger Fabric, a permissioned blockchain, to restrict access to authorized users, enhancing the confidentiality and integrity of the system.
- Implemented Solidity to create smart contracts enforcing rules and regulations within the drug supply chain.
- Divided the workflow into four units: Company Registration, Drug Registration, Transfer Drug, and View Lifecycle, providing a comprehensive solution for stakeholders.
- Enabled companies to register on the platform, verify their identity through a digital identity system, and register their drugs on the blockchain.
- Developed the Transfer Drug unit, ensuring secure drug transfers along the supply chain, maintaining origin and authenticity.
- Implemented the View Lifecycle unit, allowing stakeholders to track the entire lifecycle of a drug, from production to distribution.
- Applied Agile methodology for efficient project management, ensuring timely delivery and meeting project objectives.
- The project showcased the use of blockchain technology as a solution to drug counterfeiting, providing a secure, transparent, and efficient drug supply chain.
- Blockchain Technologies: Hyperledger Fabric, Solidity
- Programming Languages: JavaScript, Python (Node.js)
- Smart Contracts: Solidity
- Database Management: Hyperledger Fabric
- Project Management: Agile Methodology
Testimonials
I am proud to have gathered testimonials from the professionals I have had the privilege of working with throughout my career. These testimonials serve as a testament to the positive impact and contributions I have made in various projects and collaborations. The feedback from my colleagues, clients, and superiors not only validates my skills and work ethic but also reflects the strength of the professional relationships I have cultivated. These testimonials stand as a valuable resource, offering insights into my work style, collaborative spirit, and the positive outcomes achieved through our collective efforts.