In this project, I created and augmented a dataset from a number of given images to train and test convolutional neural network which is used to classify five classes of images of scanned documents. In order to generate the dataset, some image processing techniques were applied such as sliding-window, rotating, flipping and pyramid-sizing. The result of this phase is a set of images having same size 244x224x3. These images after being labeled were divided into three dataset for training, validating and testing the network.
The network is a simple convolution neural network which is also called LeNet. It has three convolutional layers and one fully connected layer. After being trained and validated, the best state of the network was pointed out and tested on the testing dataset and some real images. The result showed that the LeNet was able to classify images of documents in a pretty high accuracy. At the end of the project, I modified the network and discussed the affect that those changes had on the network with the purpose of creating another similar network which can perform better than the original one. The result proved that it worked a little better than its original version.
Contents
1 Introduction
1.1 Context
1.1.1 About ICTLab
1.1.2 ARCHIVES project
1.1.3 Internship context
1.2 Report organization
2 State of the art
2.1 Artificial intelligence & machine learning
2.2 Artificial neural network (ANN)
2.2.1 History
2.2.2 Regular neural network
2.2.3 Convolutional neural network (LeNet)
2.2.4 Training and evaluating
3 Contribution
3.1 Data creation and augmentation
3.1.1 ARCHIVES dataset
3.1.2 Creating data
3.1.3 Augmenting the data
3.1.4 Summary and Result
3.2 Constructing the convolution neural network (LeNet) .
3.2.1 The model
3.2.2 Preparing data
3.2.3 Training
3.2.4 Validation and testing
3.3 Developing the network
4 Results
4.1 The basic network
4.1.1 Testing on the dataset
4.1.2 Testing on real images
4.2 The network modifications
4.2.1 Fully connected layer
4.2.2 Convolutional layers
4.3 The new network
5 Conclusion
A Transfer functions
- Quote paper
- Tai Doan (Author), 2016, Convolutional Neural Network in classifying scanned documents, Munich, GRIN Verlag, https://www.grin.com/document/349852
-
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X. -
Upload your own papers! Earn money and win an iPhone X.