• English
    • 日本語 (Japanese)
Avinton JapanAvinton JapanAvinton JapanAvinton Japan
  • Services
    • Avinton Data Platform
    • Edge AI Camera
    • Private Cloud
    • AI Service Development
  • Blog
  • Avinton Academy
  • Careers
    • Jobs
  • About
    • Company Profile
    • Message from the CEO
    • Sustainability
  • Contact

Image Classification Task

  • Infrastructure (Server components, RAID)
  • Machine Learning Engineer Skills Required
  • Front End Technologies: Environment
  • Front End Technologies: Demo API
  • Setting Up Ubuntu
  • PostgreSQL Setup
  • REDIS
  • Privacy Policy
  • Loading Data into PostgreSQL & Redis
  • Python with Redis & PostgreSQL
  • Install Python2.7 & OpenCV
  • OpenCV Simple Exercise
  • Image Classification Task
  • Questions
  • Extending VirtualBox Virtual Drive
  • PostgreSQL Data Analyses
  • Infrastructure (server components, RAID)
  • PostgreSQL – Python – Apache – Bootstrap
  • Data Scientist
  • Google Chrome DevTools Basics for Web App Development
Home Avinton Academy Image Classification Task

Task Explanation

This task is designed to demonstrate how we can use programming to solve a real-world problem.

We will extract images from pdf files and classify them into 3 categories:

  • Text
  • Diagram
  • Image

Some sample pdf files can be downloaded here.

Examples of the image types are:

Text:
Image Classification Task

Diagram:
Image Classification Task

Image:

Image Classification Task

For this task we will use the OpenCV library.
The environment setup is described here and here.

For the image class we can further split the images into their individual photos ex:

Image Classification Task

This is an individual task but we will collaborate on this during the session.

You can ask questions any time via email or google hangouts and also during the training.

The expected output is as follows:

  • One directory per PDF file
  • Inside the directory a list of images from the pdf file
  • The directory should also contain a text file with the image name and the image type
  • Photos extracted from the Image types should be prefixed with extracted ex: extracted_###.jpg

Text file should look like this:

img_1.jpg         Text
img_2.jpg         Text
img_3.jpg         Diagram
img_4.jpg         Diagram
img_5.jpg     Image
extracted_001     Image
extracted_002         Image

 

Some hints:

For extracting images from PDF files check Linux command pdfimages
Then we need to loop through the files in a directory ex:
for directory, subdirectories, files in os.walk(source_folder):

Loops are common programming structures – Python For Loops

 

Useful References

Python Documentation

OpenCV Documentation

採用情報

採用情報

Avinton SDGs

SDGsへの貢献

Search

Tags

5G AI AI AIエンジニア Big Data careers Commodity Hardware Construction Industry Consulting Corporate Innovation Data-Driven Company Data Science Digital Transformation Edge AI Edge Computing Imagine Analysis Infrastructure Interface IoT IT engineering Kubernetes Linux LPIC-2 Machine Learning Machine Vision Manufacturing Industry PostgreSQL Precision Farming Predictive Maintenance Programmer Safety Management Smart Agriculture Smart Drones Smart Factory Smart Manufacturing Society 5.0 Sound Analysis Storage Study Guide Yield Forecasting インフラ エッジコンピューティング キャリア コンテナ技術 資格
© 2023 Avinton | All Rights Reserved | プライバシーポリシー
  • Services
    • Avinton Data Platform
    • Edge AI Camera
    • Private Cloud
    • AI Service Development
  • Blog
  • Avinton Academy
  • Careers
    • Jobs
  • About
    • Company Profile
    • Message from the CEO
    • Sustainability
  • Contact
  • English
    • 日本語 (Japanese)
Avinton Japan