Deep Learning and Deep Learning Models Used in Image Analysis
Abstract
In order to establish a machine learning system or model definition
with classical machine learning techniques, it is necessary to first extract
the feature vector. Experts are needed for extract the feature vector. For this
reason, these techniques are insufficient at the point where a raw data can be
processed. Deep learning has made tremendous progress by eliminating this
problem, which has been a challenge for many years in the field of machine
learning. Unlike traditional machine learning and image processing techniques,
Deep Learning do the learning process on raw data. It obtains the necessary
information from the representations that it formed in different layers. Deep
learning uses many areas such as image recognition, voice recognition, natural
language processing and gene analysis etc. Deep learning first attracted
attention with its success in the Large Scale Visual Recognition (ImageNet)
competition for object classification in 2012. In fact, the foundations of Deep
Learning depend on the past. But it has become popular in recent years mainly
due to two reasons. The first is the existence of as much data as training. The
second is the hardware infrastructure that will process this data. In this
study, information about deep learning was given and detailed information about
layers of convolution, pooling, ReLu and fully connected layers, which are
layers of Convolution Neural Network (CNN) architecture. It also describes
AlexNet, ZFNet, GoogLeNet, Microsoft RestNet and Region with Convolution Neural
Network (R-CNN) architectures, which can be considered as basic architects for
Deep Learning.
Keywords
References
- Amodei, D., S. Ananthanarayanan, R. Anubhai, J. Bai, E. Battenberg, C. Case, J. Casper, B. Catanzaro, Q. Cheng and G. Chen (2016). Deep speech 2: End-to-end speech recognition in english and mandarin. International Conference on Machine Learning.
- Assael, Y. M., B. Shillingford, S. Whiteson and N. de Freitas (2016). "LipNet: End-to-End Sentence-level Lipreading."
- Bahdanau, D., J. Chorowski, D. Serdyuk, P. Brakel and Y. Bengio (2016). End-to-end attention-based large vocabulary speech recognition. Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on, IEEE.
- Bengio, Y., A. Courville and P. Vincent (2013). "Representation Learning: A Review and New Perspectives." Ieee Transactions on Pattern Analysis and Machine Intelligence 35(8): 1798-1828.
- Bengio, Y., P. Lamblin, D. Popovici and H. Larochelle (2007). "Greedy layer-wise training of deep networks." In NIPS’2006 . 14, 19, 200, 323, 324, 530, 532.
- Bengio, Y. and Y. LeCun (2007). "Scaling learning algorithms towards AI." In Large Scale Kernel Machines . 19.
- Bergstra, J. and Y. Bengio (2012). "Random search for hyper-parameter optimization." Journal of Machine Learning Research 13(Feb): 281-305.
- Cao, Z., T. Simon, S.-E. Wei and Y. Sheikh (2016). "Realtime multi-person 2d pose estimation using part affinity fields." arXiv preprint arXiv:1611.08050.
Details
Primary Language
Turkish
Subjects
Engineering
Journal Section
Review
Publication Date
December 29, 2017
Submission Date
July 24, 2017
Acceptance Date
December 8, 2017
Published in Issue
Year 2017 Volume: 6 Number: 3