{"id":66555,"date":"2022-06-16T17:40:55","date_gmt":"2022-06-16T08:40:55","guid":{"rendered":"https:\/\/avinton.com\/?page_id=66555"},"modified":"2023-04-25T15:33:36","modified_gmt":"2023-04-25T06:33:36","slug":"data-scientist","status":"publish","type":"page","link":"https:\/\/avinton.com\/en\/academy\/data-scientist\/","title":{"rendered":"Data Scientist"},"content":{"rendered":"<div class=\"wpb-content-wrapper\"><p>[vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1634099672517{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 90px !important;padding-bottom: 90px !important;background-image: url(https:\/\/avinton.com\/wp-content\/uploads\/2020\/07\/avinton-news-1.jpg?id=28403) !important;}&#8221;][vc_column][vc_custom_heading text=&#8221;Data Scientist&#8221; font_container=&#8221;tag:h1|font_size:48|text_align:center|color:%23ffffff|line_height:1.5&#8243; use_theme_fonts=&#8221;yes&#8221; css=&#8221;.vc_custom_1655368020758{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 0px !important;padding-bottom: 0px !important;}&#8221;][vc_custom_heading text=&#8221;Start Your Engineering Training Journey With Avinton Academy&#8221; font_container=&#8221;tag:h2|font_size:20|text_align:center|color:%23ffffff|line_height:1.5&#8243; use_theme_fonts=&#8221;yes&#8221; css=&#8221;.vc_custom_1655370563900{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 15px !important;padding-bottom: 0px !important;}&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; css=&#8221;.vc_custom_1655368235410{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 70px !important;}&#8221;][vc_column width=&#8221;1\/4&#8243;][vc_widget_sidebar sidebar_id=&#8221;thefox_mc_sidebar&#8221;][\/vc_column][vc_column width=&#8221;3\/4&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\">What does a data scientist do?<\/h2>\n<p dir=\"auto\" data-sourcepos=\"5:1-5:144\">A data scientist works with machine learning techniques like natural language processing and neural networks to build models that power AI\u2013based applications.<\/p>\n<h2 class=\"header2\">What are career opportunities for data scientists?<\/h2>\n<p>Text Text Text<\/p>\n<h2 class=\"header2\"><strong>Some examples of AI-based applications developed at Avinton<\/strong><\/h2>\n<p>[\/vc_column_text][vc_row_inner content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655371234892{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 70px !important;}&#8221;][vc_column_inner width=&#8221;1\/2&#8243;][vc_single_image image=&#8221;66558&#8243; img_size=&#8221;408&#215;245&#8243;][\/vc_column_inner][vc_column_inner width=&#8221;1\/2&#8243;][vc_column_text]<\/p>\n<h3 dir=\"auto\" style=\"text-align: center;\" data-sourcepos=\"9:1-9:57\"><strong>Environmental Conservation in the Philippines using an AI Camera<\/strong><\/h3>\n<p dir=\"auto\" style=\"text-align: center;\" data-sourcepos=\"13:1-13:211\">Avinton in collaboration with Ericsson deployed an Artificial Intelligence (AI)-enabled CCTV cameras to identify migratory bird species and enable better preservation of endangered wildlife in Sasmuan, Pampanga.<\/p>\n<p>[\/vc_column_text][\/vc_column_inner][\/vc_row_inner][vc_row_inner content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655371237693{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 25px !important;padding-bottom: 70px !important;}&#8221;][vc_column_inner width=&#8221;1\/2&#8243;][vc_single_image image=&#8221;54778&#8243; img_size=&#8221;408&#215;245&#8243;][\/vc_column_inner][vc_column_inner width=&#8221;1\/2&#8243;][vc_column_text]<\/p>\n<h3 class=\"vc_custom_heading vc_custom_1619511157266\" style=\"text-align: center;\">Securing Worker Safety At Manufacturing Plants in Real-Time<\/h3>\n<div class=\"wpb_text_column wpb_content_element ct_kcjgxk4pnq5f09l8u65c\">\n<div class=\"wpb_wrapper\">\n<p style=\"text-align: center;\">Optical devices powered by pre-trained AI models have been deployed throughout a factory floor. These devices detect body parts of workers entering areas of risk \u2013 for example, hands getting too close to open rollers, heavy machinery, or robots. The benefits of Avinton\u2019s solution include the real-time monitoring of worker safety in cases where milliseconds matter.<\/p>\n<\/div>\n<\/div>\n<p>[\/vc_column_text]<style type=\"text\/css\" >#b_nr6lvsfr7e99zzqeefcq a{color:#ffffff; background:#b50000; margin-top:0px; margin-bottom:0px;  margin-left:0px; margin-right:0px; font-weight:700; border-radius:0px;}#b_nr6lvsfr7e99zzqeefcq a:hover{color:#ffffff ; background:#d30000 ;}<\/style><div class=\"ta_center tf_btn_pos\" id=\"b_nr6lvsfr7e99zzqeefcq\" ><a class=\"rd_normal_bt medium_rd_bt  \" href=\"https:\/\/avinton.com\/en\/services\/ai-service-development\/\" target=\"\"   >See more use cases<\/a><\/div>[\/vc_column_inner][\/vc_row_inner][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655370718685{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 25px !important;padding-bottom: 0px !important;}&#8221;][vc_column][vc_custom_heading text=&#8221;Start your journey to becoming a Data Scientist&#8221; font_container=&#8221;tag:h2|text_align:center|color:%23ffffff&#8221; use_theme_fonts=&#8221;yes&#8221; css=&#8221;.vc_custom_1655370697736{margin-right: 0px !important;margin-left: 0px !important;border-right-width: 0px !important;border-left-width: 0px !important;padding-top: 32px !important;padding-right: 32px !important;padding-bottom: 32px !important;padding-left: 32px !important;background-color: #0e3350 !important;border-radius: 15px !important;}&#8221;][vc_row_inner content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655370501418{margin-bottom: 0px !important;border-bottom-width: 0px !important;padding-bottom: 70px !important;}&#8221;][vc_column_inner width=&#8221;2\/3&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\"><span style=\"vertical-align: inherit;\">Linux<\/span><\/h2>\n<p><span style=\"vertical-align: inherit;\">As a starting point for a Machine Learning Engineer, we recommend at least <\/span><a href=\"https:\/\/linuc.org\/linuc1\/\" target=\"_blank\" rel=\"noopener noreferrer\"><span style=\"vertical-align: inherit;\">LPIC level 1<\/span><\/a><span style=\"vertical-align: inherit;\">.<\/span><\/p>\n<p><span style=\"vertical-align: inherit;\">This will ensure you don&#8217;t waste time trying to figure out what you should be typing on the command line to manipulate your environment.<\/span><\/p>\n<p><span style=\"vertical-align: inherit;\">For engineers experienced in linux or MacOS command line this may not be required<\/span>[\/vc_column_text][\/vc_column_inner][vc_column_inner width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;13351&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;center&#8221; css=&#8221;.vc_custom_1523333989220{margin-top: 1px !important;background-position: 0 0 !important;background-repeat: no-repeat !important;}&#8221;][\/vc_column_inner][\/vc_row_inner][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655368717964{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 35px !important;}&#8221;][vc_column width=&#8221;2\/3&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\">Machine Learning Development Environment<\/h2>\n<p>The most commonly used environment for machine learning in Python is Jupyter Notebook &amp; Anaconda<\/p>\n<p>Please find some links below how to set these up<\/p>\n<p><a href=\"http:\/\/jupyter.org\/\" target=\"_blank\" rel=\"noopener noreferrer\">Jupyter Notebook<\/a><br \/>\n<a href=\"https:\/\/www.anaconda.com\/about-us\/\" target=\"_blank\" rel=\"noopener noreferrer\">Anaconda<\/a><br \/>\n<a href=\"https:\/\/avinton.com\/academy\/image-classification\/\" target=\"_blank\" rel=\"noopener noreferrer\">Jupyter &amp; Anaconda setup for Machine Learning in Python<\/a> (Avinton Academy)[\/vc_column_text][\/vc_column][vc_column width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;13362&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;center&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655368722618{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 35px !important;}&#8221;][vc_column width=&#8221;2\/3&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\">Machine Learning Basics<\/h2>\n<p>Python has some native machine learning capability and building up on your Python Programming Knowledge we recommend this <a href=\"https:\/\/www.amazon.co.jp\/%E3%82%BC%E3%83%AD%E3%81%8B%E3%82%89%E4%BD%9C%E3%82%8BDeep-Learning-%E2%80%95Python%E3%81%A7%E5%AD%A6%E3%81%B6%E3%83%87%E3%82%A3%E3%83%BC%E3%83%97%E3%83%A9%E3%83%BC%E3%83%8B%E3%83%B3%E3%82%B0%E3%81%AE%E7%90%86%E8%AB%96%E3%81%A8%E5%AE%9F%E8%A3%85-%E6%96%8E%E8%97%A4-%E5%BA%B7%E6%AF%85\/dp\/4873117585\/ref=sr_1_1?ie=UTF8&amp;qid=1523322361&amp;sr=8-1&amp;keywords=python+deep\" target=\"_blank\" rel=\"noopener noreferrer\">Python Deep Learning book<\/a> for a gentle transition from programming to machine learning<br \/>\nHere are some introductory tutorials about Machine Learning Basics:<\/p>\n<p><a href=\"https:\/\/avinton.com\/academy\/reinforcement-with-chainerrl\/\" target=\"_blank\" rel=\"noopener noreferrer\">ChainerRL<\/a> (Avinton Academy)<br \/>\n<a href=\"https:\/\/avinton.com\/academy\/random-forest\/\" target=\"_blank\" rel=\"noopener noreferrer\">Random Forest<\/a> (Avinton Academy)<br \/>\n<a href=\"https:\/\/avinton.com\/academy\/naive-bayes\/\" target=\"_blank\" rel=\"noopener noreferrer\">Naive Bayes<\/a> (Avinton Academy)<br \/>\n<a href=\"https:\/\/avinton.com\/academy\/classification-regression\/\" target=\"_blank\" rel=\"noopener noreferrer\">Classification vs Regression<\/a> (Avinton Academy)<br \/>\n<a href=\"https:\/\/avinton.com\/academy\/svm\/\" target=\"_blank\" rel=\"noopener noreferrer\">SVM<\/a> (Avinton Academy)[\/vc_column_text][\/vc_column][vc_column width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;13366&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;center&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655368726648{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 35px !important;}&#8221;][vc_column width=&#8221;2\/3&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\">Machine Learning Libraries and Frameworks<\/h2>\n<p>Depending on the application and the environment a machine learning project may use different environment, libraries, frameworks that are best suited for the problem being solved.<\/p>\n<p>The most common ones today are:<\/p>\n<p><a href=\"https:\/\/www.tensorflow.org\/\" target=\"_blank\" rel=\"noopener noreferrer\">TensorFlow<\/a><br \/>\n<a href=\"https:\/\/keras.io\/\" target=\"_blank\" rel=\"noopener noreferrer\">Keras<\/a><br \/>\n<a href=\"http:\/\/scikit-learn.org\/stable\/\" target=\"_blank\" rel=\"noopener noreferrer\">Scikit Learn<\/a><\/p>\n<p>You should try to familiarise yourself with these[\/vc_column_text][\/vc_column][vc_column width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;13381&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;center&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655368730827{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 35px !important;}&#8221;][vc_column width=&#8221;2\/3&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\">Data Analyses<\/h2>\n<p>Depending on the type of problem being solved you may be required to do some data processing and data analyses.<\/p>\n<h3>SQL<\/h3>\n<p>In many large organisations data is stored in large Relational Databases. We use SQL for querying and manipulating the data in the database. SQL is a very powerful language where we can perform some rather complex data analyses with only a few lines of simple code. You are encouraged to familiarise yourself with SQL using <a href=\"https:\/\/www.postgresql.org\/\" target=\"_blank\" rel=\"noopener noreferrer\">PostgreSQL database<\/a> as it is the most advanced Open Source Database.<\/p>\n<h3>Python &#8211; Numpy<\/h3>\n<p>During the data pre-processing for Machine Learning we often need to do some data extraction and manipulation from the original dataset. <a href=\"http:\/\/www.numpy.org\/\" target=\"_blank\" rel=\"noopener noreferrer\">Python Numpy<\/a> is a package that gives quite extensive and advanced mathatics functionality that can be used for all kinds of data modelling and analyses.<\/p>\n<p>You are encouraged to familiarise yourself with the numpy&#8217;s capability by browsing through the <a href=\"https:\/\/numpy.org\/doc\/stable\/reference\/\" target=\"_blank\" rel=\"noopener noreferrer\">NUMPY API reference<\/a><\/p>\n<h3>Pandas<\/h3>\n<p><a href=\"https:\/\/pandas.pydata.org\/\" target=\"_blank\" rel=\"noopener noreferrer\">Pandas<\/a> is a library for working with data structures in Python. It extends the capability of Numpy.<br \/>\n<a href=\"http:\/\/pandas.pydata.org\/pandas-docs\/stable\/api.html\" target=\"_blank\" rel=\"noopener noreferrer\">Pandas API Reference<\/a><\/p>\n<p>Have a look at the documentation and see which functions may be useful for your data manipulation requirements.[\/vc_column_text][\/vc_column][vc_column width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;13379&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;center&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655368738147{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 35px !important;}&#8221;][vc_column width=&#8221;2\/3&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\"><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">OpenCV &#8211; Image Processing<\/span><\/span><\/h2>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">Many Machine Learning Engineers and projects in the industry today are using AI for object detection and other related image analyses.<\/span><\/span><\/p>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">For any of such projects we usually use OpenCV integrated within a machine learning library to label the images and do any pre-processing if required<\/span><\/span><\/p>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">You can find some OpenCV tutorials below:<\/span><\/span><\/p>\n<p><a href=\"https:\/\/avinton.com\/academy\/python2-7%E3%81%A8opencv%E3%81%AE%E3%82%A4%E3%83%B3%E3%82%B9%E3%83%88%E3%83%BC%E3%83%AB\/\" target=\"_blank\" rel=\"noopener noreferrer\">Python OpenCV setup<\/a> (Avinton Academy)<br \/>\n<a href=\"https:\/\/avinton.com\/academy\/opencv%E3%81%AE%E3%83%86%E3%82%B9%E3%83%88%E3%83%97%E3%83%AD%E3%82%B0%E3%83%A9%E3%83%A0\/\" target=\"_blank\" rel=\"noopener noreferrer\">OpenCV simple Exercise<\/a> (Avinton Academy)<br \/>\n<a href=\"https:\/\/avinton.com\/academy\/python%E3%81%AB%E3%82%88%E3%82%8B%E3%83%9E%E3%83%AB%E3%83%81%E3%82%B9%E3%83%AC%E3%83%83%E3%83%89%E3%83%97%E3%83%AD%E3%82%B0%E3%83%A9%E3%83%9F%E3%83%B3%E3%82%B0%E5%AE%9F%E8%B7%B5\/\" target=\"_blank\" rel=\"noopener noreferrer\">OpenCV Advanced Exercise<\/a> (Avinton Academy)[\/vc_column_text][\/vc_column][vc_column width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;13372&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;center&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655368742883{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 35px !important;}&#8221;][vc_column width=&#8221;2\/3&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\">Online Courses<\/h2>\n<p><a href=\"https:\/\/www.coursera.org\/learn\/machine-learning\" target=\"_blank\" rel=\"noopener noreferrer\">Stanford Machine Learning &#8211; Coursera &#8211; Andrew Ng<\/a><br \/>\nLecture Series is available on <a href=\"https:\/\/www.youtube.com\/playlist?list=PLLssT5z_DsK-h9vYZkQkYNWcItqhlRJLN\" target=\"_blank\" rel=\"noopener noreferrer\">YouTube<\/a><\/p>\n<p><a href=\"https:\/\/www.coursera.org\/specializations\/deep-learning\" target=\"_blank\" rel=\"noopener noreferrer\">Coursera Deep Learning<\/a><\/p>\n<p><a href=\"http:\/\/www.fast.ai\/\" target=\"_blank\" rel=\"noopener noreferrer\">FastAI<\/a> have a good set of courses which are free especially their <a href=\"https:\/\/github.com\/fastai\/numerical-linear-algebra\/blob\/master\/README.md\" target=\"_blank\" rel=\"noopener noreferrer\">Computational Linear Algebra course<\/a>.<\/p>\n<h3>Lecture Series<\/h3>\n<p><a href=\"https:\/\/sites.google.com\/view\/deep-rl-bootcamp\/lectures\" target=\"_blank\" rel=\"noopener noreferrer\">Deep RL Bootcamp<\/a><br \/>\n<a href=\"https:\/\/www.youtube.com\/playlist?list=PL3FW7Lu3i5JvHM8ljYj-zLfQRF3EO8sYv\" target=\"_blank\" rel=\"noopener noreferrer\">Andrej Karpathy\u2019s CNN course at Stanford<\/a> (CS231n: Convolutional Neural Networks for Visual Recognition)<br \/>\n<a href=\"http:\/\/rll.berkeley.edu\/deeprlcourse\/\" target=\"_blank\" rel=\"noopener noreferrer\">Sergey Levine\u2019s Deep Reinforcement Learning Course<\/a> &#8211; UC Berkeley (CS 294: Deep Reinforcement Learning)<br \/>\n<a href=\"http:\/\/www.learningmachines101.com\/2016\/\" target=\"_blank\" rel=\"noopener noreferrer\">Learning Machines 101<\/a> by Richard Golden[\/vc_column_text][\/vc_column][vc_column width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;13394&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;center&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655368750232{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 35px !important;}&#8221;][vc_column width=&#8221;2\/3&#8243;][vc_column_text]<\/p>\n<h2 class=\"header2\"><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">Infrastructure Basics<\/span><\/span><\/h2>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">Machine Learning tasks are typically rather computationally intensive so it is important that a Machine Learning Engineer understands some infrastructure basics in order to take full advantage of the underlying server hardware in our environment to make our model training process efficient.<\/span><\/span><\/p>\n<h3><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">Server Resources: CPU \/ Disk \/ RAM<\/span><\/span><\/h3>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">Avinton Academy instructor led training will give you the basics on this area. Please attend the next available session when you have time.<\/span><\/span><\/p>\n<h3><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">CPU vs GPU<\/span><\/span><\/h3>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">In many cases we use a GPU to train the machine learning model more efficiently. It will be good to familiarise yourself with how a GPU works and why it is better at certain tasks than a CPU<\/span><\/span><\/p>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">This is covered in the Avinton Academy Infrastructure workshop day 2<\/span><\/span><\/p>\n<h3><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">Virtualization Concepts<\/span><\/span><\/h3>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">In most cases your development environment is likely going to be on a virtual machine. As such it is good for you to familiarise yourself with Virtualisation concepts. The best approach for this would be to set up VMWare ESXI on a server from scratch and create the host Virtual Machines using the Hypervisor&#8217;s web gui.<\/span><\/span><\/p>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">This is covered in Avinton Academy Infrastructure Workshop.<\/span><\/span><\/p>\n<h3><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">AWS EC2<\/span><\/span><\/h3>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">Many AI projects are deployed in the cloud. AWS (Amazon Web Services) is the leader in cloud services and as such we recommend you familiarise yourself with working with their EC2 platform (Elastic Compute).<\/span><\/span><\/p>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">You can refer to <\/span><\/span><a href=\"https:\/\/aws.amazon.com\/jp\/documentation\/?nc2=h_ql_d&amp;awsm=ql-5\" target=\"_blank\" rel=\"noopener noreferrer\"><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">AWS&#8217;s official documentation<\/span><\/span><\/a><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\"> and on Avinton Academy we have a dedicated section guiding you through <\/span><\/span><a href=\"https:\/\/avinton.com\/academy\/\" target=\"_blank\" rel=\"noopener noreferrer\"><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">how to work with AWS<\/span><\/span><\/a><\/p>\n<h3><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">Docker Containers<\/span><\/span><\/h3>\n<p><span style=\"vertical-align: inherit;\"><span style=\"vertical-align: inherit;\">We often use Docker containers to easily replicate our environment from one system to another. As such I recommend you familiarise yourself with Docker Containers and try to Dockerise your environment for sharing with other team members.<\/span><\/span>[\/vc_column_text][\/vc_column][vc_column width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;13373&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;center&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1655368750232{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 35px !important;padding-bottom: 35px !important;}&#8221;][vc_column width=&#8221;2\/3&#8243;][vc_custom_heading text=&#8221;\u6a5f\u68b0\u5b66\u7fd2 \/ AI&#8221; use_theme_fonts=&#8221;yes&#8221; el_class=&#8221;header2&#8243; css=&#8221;.vc_custom_1655368514422{padding-bottom: 32px !important;}&#8221;][vc_column_text]<a href=\"https:\/\/avinton.com\/academy\/machine-learning-engineer-skills-required-ja\/\">\u6a5f\u68b0\u5b66\u7fd2\u30a8\u30f3\u30b8\u30cb\u30a2\u306b\u5fc5\u8981\u306a\u30b9\u30ad\u30eb<\/a><\/p>\n<p><a href=\"https:\/\/avinton.com\/academy\/image-classification\/\">\u6a5f\u68b0\u5b66\u7fd2\u3092\u7528\u3044\u305f\u753b\u50cf\u5206\u985e<\/a><\/p>\n<p><a href=\"https:\/\/avinton.com\/academy\/reinforcement-with-chainerrl\/\">\u6a5f\u68b0\u5b66\u7fd2\u5165\u9580\u8005\u5411\u3051 ChainerRL\u3067\u30d6\u30ed\u30c3\u30af\u5d29\u3057\u306e\u5b66\u7fd2<\/a><\/p>\n<p><a href=\"https:\/\/avinton.com\/academy\/random-forest\/\">\u6a5f\u68b0\u5b66\u7fd2\u5165\u9580\u8005\u5411\u3051 \u30e9\u30f3\u30c0\u30e0\u30d5\u30a9\u30ec\u30b9\u30c8\u306b\u3088\u308bTitanic\u751f\u5b58\u8005\u4e88\u6e2c<\/a><\/p>\n<p><a href=\"https:\/\/avinton.com\/academy\/naive-bayes\/\">\u6a5f\u68b0\u5b66\u7fd2\u5165\u9580\u8005\u5411\u3051 Naive Bayes(\u5358\u7d14\u30d9\u30a4\u30ba)\u30a2\u30eb\u30b4\u30ea\u30ba\u30e0\u306b\u89e6\u308c\u3066\u307f\u308b<\/a><\/p>\n<p><a href=\"https:\/\/avinton.com\/academy\/classification-regression\/\">\u6a5f\u68b0\u5b66\u7fd2\u5165\u9580\u8005\u5411\u3051 \u5206\u985e\u3068\u56de\u5e30\u306e\u9055\u3044\u3092\u30d7\u30ed\u30b0\u30e9\u30e0\u3092\u66f8\u3044\u3066\u5b66\u307c\u3046<\/a><\/p>\n<p><a href=\"https:\/\/avinton.com\/academy\/svm\/\">\u6a5f\u68b0\u5b66\u7fd2\u5165\u9580\u8005\u5411\u3051Support Vector Machine (SVM) \u306b\u89e6\u308c\u3066\u307f\u308b<\/a>[\/vc_column_text][\/vc_column][vc_column width=&#8221;1\/3&#8243;][vc_single_image image=&#8221;64375&#8243; img_size=&#8221;medium&#8221; alignment=&#8221;right&#8221;][\/vc_column][\/vc_row][vc_row][vc_column][vc_empty_space height=&#8221;70px&#8221;][\/vc_column][\/vc_row]<\/p>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>[vc_row type=&#8221;full-width-section&#8221; content_placement=&#8221;middle&#8221; css=&#8221;.vc_custom_1634099672517{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 90px !important;padding-bottom: 90px !important;background-image: url(https:\/\/avinton.com\/wp-content\/uploads\/2020\/07\/avinton-news-1.jpg?id=28403) !important;}&#8221;][vc_column][vc_custom_heading text=&#8221;Data Scientist&#8221; font_container=&#8221;tag:h1|font_size:48|text_align:center|color:%23ffffff|line_height:1.5&#8243; use_theme_fonts=&#8221;yes&#8221; css=&#8221;.vc_custom_1655368020758{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 0px !important;padding-bottom: 0px !important;}&#8221;][vc_custom_heading text=&#8221;Start Your Engineering Training Journey With Avinton Academy&#8221; font_container=&#8221;tag:h2|font_size:20|text_align:center|color:%23ffffff|line_height:1.5&#8243; use_theme_fonts=&#8221;yes&#8221; css=&#8221;.vc_custom_1655370563900{margin-top: 0px !important;margin-bottom: 0px !important;border-top-width: 0px !important;border-bottom-width: 0px !important;padding-top: 15px<br \/><a href=\"https:\/\/avinton.com\/en\/academy\/data-scientist\/\" class=\"more\">Read more<\/a><\/p>\n","protected":false},"author":2,"featured_media":13374,"parent":1906,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"categories":[],"tags":[],"class_list":["post-66555","page","type-page","status-publish","has-post-thumbnail","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.8 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Data Scientist - Avinton Japan<\/title>\n<meta name=\"description\" content=\"What a machine learning engineer needs to know to be productive and effective in an AI project in today&#039;s active job market.\" \/>\n<meta name=\"robots\" content=\"noindex, nofollow\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data Scientist - Avinton Japan\" \/>\n<meta property=\"og:description\" content=\"What a machine learning engineer needs to know to be productive and effective in an AI project in today&#039;s active job market.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/avinton.com\/en\/academy\/data-scientist\/\" \/>\n<meta property=\"og:site_name\" content=\"Avinton Japan\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/Avintons\/\" \/>\n<meta property=\"article:modified_time\" content=\"2023-04-25T06:33:36+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/avinton.com\/wp-content\/uploads\/2018\/04\/machine-learning-need-to-know.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"824\" \/>\n\t<meta property=\"og:image:height\" content=\"501\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@AvintonJapan\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/avinton.com\/en\/academy\/data-scientist\/\",\"url\":\"https:\/\/avinton.com\/en\/academy\/data-scientist\/\",\"name\":\"Data Scientist - Avinton Japan\",\"isPartOf\":{\"@id\":\"https:\/\/avinton.com\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/avinton.com\/en\/academy\/data-scientist\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/avinton.com\/en\/academy\/data-scientist\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/avinton.com\/wp-content\/uploads\/2018\/04\/machine-learning-need-to-know.jpg\",\"datePublished\":\"2022-06-16T08:40:55+00:00\",\"dateModified\":\"2023-04-25T06:33:36+00:00\",\"description\":\"What a machine learning engineer needs to know to be productive and effective in an AI project in today's active job market.\",\"breadcrumb\":{\"@id\":\"https:\/\/avinton.com\/en\/academy\/data-scientist\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/avinton.com\/en\/academy\/data-scientist\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/avinton.com\/en\/academy\/data-scientist\/#primaryimage\",\"url\":\"https:\/\/avinton.com\/wp-content\/uploads\/2018\/04\/machine-learning-need-to-know.jpg\",\"contentUrl\":\"https:\/\/avinton.com\/wp-content\/uploads\/2018\/04\/machine-learning-need-to-know.jpg\",\"width\":824,\"height\":501,\"caption\":\"Avinton machine learning need to know\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/avinton.com\/en\/academy\/data-scientist\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/avinton.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Avinton Academy\",\"item\":\"https:\/\/avinton.com\/en\/academy\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Data Scientist\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/avinton.com\/en\/#website\",\"url\":\"https:\/\/avinton.com\/en\/\",\"name\":\"Avinton Japan\",\"description\":\"Tailored Solutions and Consulting in AI and Big Data\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/avinton.com\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Data Scientist - Avinton Japan","description":"What a machine learning engineer needs to know to be productive and effective in an AI project in today's active job market.","robots":{"index":"noindex","follow":"nofollow"},"og_locale":"en_US","og_type":"article","og_title":"Data Scientist - Avinton Japan","og_description":"What a machine learning engineer needs to know to be productive and effective in an AI project in today's active job market.","og_url":"https:\/\/avinton.com\/en\/academy\/data-scientist\/","og_site_name":"Avinton Japan","article_publisher":"https:\/\/www.facebook.com\/Avintons\/","article_modified_time":"2023-04-25T06:33:36+00:00","og_image":[{"width":824,"height":501,"url":"https:\/\/avinton.com\/wp-content\/uploads\/2018\/04\/machine-learning-need-to-know.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_site":"@AvintonJapan","twitter_misc":{"Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/avinton.com\/en\/academy\/data-scientist\/","url":"https:\/\/avinton.com\/en\/academy\/data-scientist\/","name":"Data Scientist - Avinton Japan","isPartOf":{"@id":"https:\/\/avinton.com\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/avinton.com\/en\/academy\/data-scientist\/#primaryimage"},"image":{"@id":"https:\/\/avinton.com\/en\/academy\/data-scientist\/#primaryimage"},"thumbnailUrl":"https:\/\/avinton.com\/wp-content\/uploads\/2018\/04\/machine-learning-need-to-know.jpg","datePublished":"2022-06-16T08:40:55+00:00","dateModified":"2023-04-25T06:33:36+00:00","description":"What a machine learning engineer needs to know to be productive and effective in an AI project in today's active job market.","breadcrumb":{"@id":"https:\/\/avinton.com\/en\/academy\/data-scientist\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/avinton.com\/en\/academy\/data-scientist\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/avinton.com\/en\/academy\/data-scientist\/#primaryimage","url":"https:\/\/avinton.com\/wp-content\/uploads\/2018\/04\/machine-learning-need-to-know.jpg","contentUrl":"https:\/\/avinton.com\/wp-content\/uploads\/2018\/04\/machine-learning-need-to-know.jpg","width":824,"height":501,"caption":"Avinton machine learning need to know"},{"@type":"BreadcrumbList","@id":"https:\/\/avinton.com\/en\/academy\/data-scientist\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/avinton.com\/en\/"},{"@type":"ListItem","position":2,"name":"Avinton Academy","item":"https:\/\/avinton.com\/en\/academy\/"},{"@type":"ListItem","position":3,"name":"Data Scientist"}]},{"@type":"WebSite","@id":"https:\/\/avinton.com\/en\/#website","url":"https:\/\/avinton.com\/en\/","name":"Avinton Japan","description":"Tailored Solutions and Consulting in AI and Big Data","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/avinton.com\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/pages\/66555","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/comments?post=66555"}],"version-history":[{"count":33,"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/pages\/66555\/revisions"}],"predecessor-version":[{"id":80528,"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/pages\/66555\/revisions\/80528"}],"up":[{"embeddable":true,"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/pages\/1906"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/media\/13374"}],"wp:attachment":[{"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/media?parent=66555"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/categories?post=66555"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/avinton.com\/en\/wp-json\/wp\/v2\/tags?post=66555"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}