top of page

Create Your First Project

Start adding your projects to your portfolio. Click on "Manage Projects" to get started

Big Data Processing for AgriTech Application

Project Type

Image feature extraction and classification

Tools

Python, PySpark, TensorFlow/Keras, PCA, t-SNE, AWS (EMR, EC2, S3)

Skills

Big Data processing, PySpark development, TensorFlow model broadcasting, Dimensionality reduction techniques, Cloud computing (AWS), Data visualization

Developed scalable data processing solutions for a mobile app aimed at identifying fruits using photos.

Designed and implemented PySpark scripts to efficiently process and prepare data for large-scale deployment in a cloud environment. Integrated a Convolutional Neural Network (CNN) using TensorFlow Keras for feature extraction and implemented model weight broadcasting to optimize distributed inference across multiple nodes.

Incorporated PCA for dimensionality reduction and t-SNE for visualization, assessing the effectiveness of feature extraction and its impact on classification.

The entire system was set up on AWS, utilizing EMR for big data processing, EC2 for computation, and S3 for data storage, ensuring compliance with GDPR.

bottom of page