logo

Projects

A list of projects I've worked on recently.

VM-based Deduplication Solution Migration to Azure Batch

VM-based Deduplication Solution Migration to Azure Batch

Successfully led the transition of a VM-based deduplication solution to Azure Batch, significantly enhancing efficiency and scalability. This initiative involved a comprehensive overhaul of the existing system, adapting it to Azure Batch's advanced cloud capabilities. I was responsible for the entire migration process, ensuring seamless execution and optimal performance. This strategic move resulted in substantial cost savings, reducing the client's annual expenses by approximately $8,000.

Learn more →
Training of Masked Large Language Models for Indigenous Bantu Languages

Training of Masked Large Language Models for Indigenous Bantu Languages

Pioneered the creation of cutting-edge Masked Language Models specifically tailored for Tshivenda and other indigenous Bantu languages.

Learn more →
Automated Regulatory Document Parsing Using NLP and ChatGPT

Automated Regulatory Document Parsing Using NLP and ChatGPT

Utilized advanced natural language models, including ChatGPT, for the precise interpretation and extraction of key information from regulatory documents. This project involved the innovative application of NLP techniques, drastically reducing manual review hours and enhancing compliance efficiency in legal document understanding.

Learn more →
Social Media Job Extraction using NLP techniques backed by Sklearn, Deep Learning, and Spacy

Social Media Job Extraction using NLP techniques backed by Sklearn, Deep Learning, and Spacy

Developed and fine-tuned natural language processing models to autonomously identify and categorize job postings on social media platforms.

Learn more →
ML-Driven Crop Disease Prediction for Effective Pest Control

ML-Driven Crop Disease Prediction for Effective Pest Control

Developed a fleet of Machine Learning models to accurately predict crop disease outbreaks and provide data-driven pest control recommendations.

Learn more →
Big Data Analytics environment setup and analysis using Apache Hadoop, Spark, and Tableau

Big Data Analytics environment setup and analysis using Apache Hadoop, Spark, and Tableau

Implemented a comprehensive Big Data ETL and visualization pipeline using the Hadoop ecosystem, Spark, and Tableau for analyzing public StackExchange data.

Learn more →