INTERNSHIP DETAILS
Computer Vision Intern
CompanyFieldAssist
LocationGurugram
Work ModeOn Site
PostedFebruary 4, 2026

Internship Information
Core Responsibilities
The intern will organize, clean, and preprocess large-scale retail image datasets and validate annotations. They will also support model training and evaluation, contributing to the production-ready pipeline.
Internship Type
intern
Company Size
454
Visa Sponsorship
No
Language
English
Working Hours
40 hours
Apply Now →
You'll be redirected to
the company's application page
About The Company
FieldAssist is the leading sales automation platform tailored specifically for FMCG and CPG brands. With our cutting-edge technology, we deliver real-time data insights, streamline field operations, and optimize Route-to-Market strategies, empowering you to achieve exceptional sales execution.
Our SFA solution empowers you with real-time sales tracking, effective order management, and exceptional on-field execution. Also, our DMS delivers seamless distributor operations, provides accurate inventory tracking to optimize your supply chain and integrates flawlessly with ERP systems.
We don’t just automate; we elevate your performance with AI-driven solutions like Sales Co-Pilot, which offers smart recommendations and guided selling to sharpen your decision-making. Our Route Optimization feature ensures your field teams minimize travel time and maximize efficiency, while Perfect Store Execution guarantees compliance and showcases excellence in-store. Moreover the Image Recognition (IR) technology enhances retail execution through real-time stock audits and performance tracking.
With our scalable solutions, deep insights, and seamless integrations, FieldAssist empowers your brand to sell smarter, execute faster, and achieve remarkable growth.
We at FieldAssist are dedicated to deliver powerful insights, an effortless mobile experience, and real-time analytics that empower brands to optimize their sales performance. Our scalable solutions and seamless integrations accelerate growth for FMCG and CPG brands, ensuring they achieve their fullest potential.
We proudly serve clients like Coca-Cola, Beiersdorf, Mars, Philips, Vivo and many more across more than 15 countries! Our dedicated approach achieves an impressive 90% market discipline by minimizing dormant outlets and effectively addressing duplicate and fake outlets. We’re excited to continue making a positive impact together!
About the Role
<p dir="ltr" style="line-height:1.2;margin-left: 1.18310546875pt;margin-top:10.090087890625pt;margin-bottom:0pt;"><span style="font-size:14.000275611877441pt;font-family:'League Spartan',sans-serif;color:#000000;background-color:transparent;font-weight:700;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Computer Vision Intern </span></p><p dir="ltr" style="line-height:1.4369253158569335;margin-left: 0.9811439514160156pt;margin-right: 21.314208984375pt;margin-top:3.00323486328125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">We are looking for a Computer Vision Intern to assist in building and refining our image recognition pipeline. The role will start with dataset management—image collection, annotation validation, dataset cleaning, and preprocessing. Once the foundational data work is complete, you’ll get hands-on exposure to model training, augmentation, and evaluation, contributing directly to our production-ready pipeline. </span></p><p dir="ltr" style="line-height:1.2;margin-left: 2.4962501525878906pt;margin-top:10.11798095703125pt;margin-bottom:0pt;"><span style="font-size:13.077427864074707pt;font-family:'Anonymous Pro',monospace;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">For one in the Seat: </span></p><p dir="ltr" style="line-height:1.2;margin-left: 3.0224075317382812pt;margin-top:5.764984130859375pt;margin-bottom:0pt;"><span style="font-size:14.000275611877441pt;font-family:'League Spartan',sans-serif;color:#000000;background-color:transparent;font-weight:700;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Responsibilities </span></p><p dir="ltr" style="line-height:1.2;margin-left: 3.0224075317382812pt;margin-top:5.764984130859375pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">1.Organize, clean, and preprocess large-scale retail image datasets.</span></p><p dir="ltr" style="line-height:1.2;margin-left: 3.0224075317382812pt;margin-top:5.764984130859375pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">2.Validate and manage annotations (bounding boxes, class labels, segmentation masks if applicable) using tools like Roboflow or CVAT or LabelImg.</span></p><p dir="ltr" style="line-height:1.2;margin-left: 3.0224075317382812pt;margin-top:5.764984130859375pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">3.Apply augmentation techniques and prepare datasets for training.</span></p><p dir="ltr" style="line-height:1.2;margin-left: 3.0224075317382812pt;margin-top:5.764984130859375pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">4.Support in training YOLOv5/YOLOv8-based models on custom datasets. </span></p><p dir="ltr" style="line-height:1.2;margin-left: 3.0224075317382812pt;margin-top:5.764984130859375pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">5.Run model evaluations (Precision, Recall, F1 Score, SKU-level accuracy). </span></p><p dir="ltr" style="line-height:1.2;margin-left: 3.0224075317382812pt;margin-top:5.764984130859375pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">6.Collaborate with the product team to improve real-world inference quality. </span></p><p dir="ltr" style="line-height:1.2;margin-left: 3.0224075317382812pt;margin-top:5.764984130859375pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">7.Document the dataset pipeline and share insights for improving data quality.</span></p><p dir="ltr" style="line-height:1.2;margin-top:13.908676147460938pt;margin-bottom:0pt;"><span style="font-size:12.64976692199707pt;font-family:'League Spartan',sans-serif;color:#000000;background-color:transparent;font-weight:700;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Who we're looking for: </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.29714202880859375pt;margin-top:4.00909423828125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Must Have: </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.29714202880859375pt;margin-top:4.00909423828125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">1.Basic understanding of Computer Vision concepts (Object Detection, Classification) </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.29714202880859375pt;margin-top:4.00909423828125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">2.Familiarity with Python (OpenCV, Pandas, NumPy) </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.29714202880859375pt;margin-top:4.00909423828125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">3.Knowledge of image annotation tools (Roboflow, LabelImg, CVAT, etc.) </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.29714202880859375pt;margin-top:4.00909423828125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">4.Ability to manage and organise large datasets </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.7173004150390625pt;margin-top:0.996368408203125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">Good to have: </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.7173004150390625pt;margin-top:0.996368408203125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">1.Experience with YOLOv5 or YOLOv8 (Training, Inference, Fine-tuning) </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.7173004150390625pt;margin-top:0.996368408203125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">2.Exposure to image augmentation techniques (Albumentations, etc.) </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.7173004150390625pt;margin-top:0.996368408203125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">3.Understanding of retail/commercial shelf datasets or product detection problems </span></p><p dir="ltr" style="line-height:1.2;margin-left: 0.7173004150390625pt;margin-top:0.996368408203125pt;margin-bottom:0pt;"><span style="font-size:12.004523277282715pt;font-family:Arial,sans-serif;color:#000000;background-color:transparent;font-weight:400;font-style:normal;font-variant:normal;text-decoration:none;vertical-align:baseline;white-space:pre;white-space:pre-wrap;">4.Previous internship or project experience in computer vision is a plus </span></p>
Key Skills
Computer VisionPythonOpenCVPandasNumPyImage AnnotationDataset ManagementYOLOv5YOLOv8Image AugmentationRetail DatasetsModel EvaluationCollaborationDocumentation
Categories
TechnologyData & AnalyticsEngineeringRetailSoftware
Prep Tools
FREE
YOUR RESUME KNOWS THE QUESTIONS
AI Question Predictor
Based on Computer Vision Intern role
Tell me about your experience with Computer Vision
Why do you want to work at FieldAssist ?
Describe a challenging project you've led
FREE
STAND OUT FROM THE CROWD
AI Cover Letter
Tailored for FieldAssist
Dear FieldAssist Hiring Team,
I am excited to apply for the Computer Vision Intern position. With my experience in Computer Vision and Python...
Continue with AI →
FREEYour ScoreTop Applicants
BOOST YOUR INTERVIEW CHANCES
?
»
8.5
Must-Have Skills for This Role
Computer VisionPythonOpenCVPandasNumPy