Design and Implementation of a Model to Predict the Success of the Bank Telemarketing


Bachelor Thesis, 2015
28 Pages, Grade: A

Excerpt

Table of Contents

INTRODUCTION
1.1 ID3 Algorithm:
1.2 Objective:
1.3 Problem Statement:
1.4 Project Scope:
1.5 Report Organization:

LITERATURE REVIEW SYSTEM DESIGN
3.1 Introduction:
3.2 Proposed Model Design:
3.2.1 Logical Design (Flow Diagram):
3.2.2 Use Case Diagram:
3.2.3 Time Sequence Diagram
3.2.4 Structural diagram:

IMPLEMENTATION
4.1 System Development Requirements:
4.1.1 Software Requirements:
4.1.2 Hardware Requirements:
4.2 Data Set Description:
4.2.1 Attribute Description:
4.3 Decision Tree:
4.3.1 Decision Tree Algorithm:
4.3.2 Pseudo code:
4.3.3 Root node:
4.3.4 For further nodes (i.e. Internal and Leaf nodes):

RESULTS

CONCLUSION

FUTURE WORK

REFERENCES

ABSTRACT

Targeting customer is a major task of bank telemarketing to send their service to customers. Now banks are using a number of data mining techniques to predict the success rate. Decision Tree is successful data mining technique for predicting bank telemarketing success. Decision Tree is a well known classifier and is simple and easy to apply. Performance of decision trees can be improved with appropriate attribute selection. In this research, ID3 decision tree technique of data mining is applied on widely used benchmark data set. The main focus of this research was on designing and implementation of a model that predicts the success of bank telemarketing using decision tree technique of data mining.

Decision Tree is powerful techniques of DM which is used for the Success of Bank Telemarketing. The proposed system makes bank telemarketing more efficient. And the proposed system’s Database will be flexible to incorporate new changes at the same time keeping data secure. It is a need of bank telemarketing to gain the confidence of their profitable customers in order to retain their loyalty. Service providers need to closely track the trend in their industry so they can position themselves for a competitive edge in the future. Evolving technology, Rejection, Fake Calls is also a big challenge in telemarketing. The proposed system can be implemented in any bank telemarketing company to aid their decision making process and predict the customers who are more likely to involve with the new upcoming product in the market.

INTRODUCTION

Telemarketing is a marketing process in which bank or any other businesses can promote their services or product to the customers through telemarketing service, E-mail service. It is also called direct marketing because businesses are directly connected with their customers.

In banks, a large amount of data recorded about their customers. This recorded data can be used to keep direct relationship and create connection with the customers in order to target them individually for banking offers. The selected customer are directly connected through remote communication channels like cellular phone, mail, e-mail and any other source of communication. This is known as direct marketing. Direct marketing is a main source of many insurance companies and banks for connecting with their customers. Now many banks have adopted classification and prediction techniques of data mining to understand the customer behavior by predicting the customer data before offering services. By using data mining techniques, banks can send services or information to only those customers which are classified by the data.

Decision Tree popular is a power tool for prediction and classification. Decision Tree is admirable tools for helping out to select between some courses of action. Decision tree representsrules. Rules can be stated so one can understand them or can be directly used in a database language like SQL so that records falling in a particular group may be retrieved. Decision treeis a classifier that is in form of tree structure in which each node is aleaf node point to value of the target attribute or class) or adecision node(points to some test to be done on a single attribute with one branch and sub-tree for each possible outcome).

1.1 ID3 Algorithm:

The proposed system used ID3 algorithm. ID3 algorithms stand for Iterative Dichotomiser 3.It is presented by J. Ross Quinlan in 1986. ID3 is a no incremental algorithm, which means it derives its classes from a fixed set of training instances. It builds the shortest and fastest tree. In ID3, prediction rules are created from training set. It is very efficient algorithm in terms of processing time as it searches the whole dataset to create tree and can handle continuous attribute. It is used to calculate logistic calculations. In ID3 algorithm, one attribute is tested at a time for making a decision. ID3 is a supervised algorithm.

A statistical property called information gain that is used for attribute selection. The attribute with the highest information gain is selected from training data set. In order to define information gain we must calculate the entropy measures, the amount of information in an attribute.

ID3 uses dataset to generate decision tree in top-down fashion. The tree is constructed in ID3 in two phases. One is tree building and second is pruning. ID3 use gain approach to determine suitable property for each node generated in decision tree.ID3 starts from set of objects. One property is tested based on maximizing information gain and minimizing entropy at each node of the tree, and then results are used to split objects. This process is continuous recursively until objects belongs to same category. Then it becomes the leaf node of the decision tree

Formulas and procedure of ID3 algorithm is following:

- Select that attribute which has highest information gain
- Let pi be the probability that an arbitrary tuple in D belongs to class Ci, estimated by |C i, D|/|D|
- Expected information (entropy)

illustration not visible in this excerpt

- Information needed

illustration not visible in this excerpt

- Information gained through branching on attribute A

illustration not visible in this excerpt

1.2 Objective:

The main focus of this research was on designing and implementation of a model that predicts the success of bank telemarketing using decision tree technique of data mining. Decision Tree is powerful techniques of DM which is used for the Success of Bank Telemarketing. The proposed system makes bank telemarketing more efficient. And the proposed system’s Database will be flexible to incorporate new changes at the same time keeping data secure.

1.3 Problem Statement:

It is a need of bank telemarketing to gain the confidence of their profitable customers in order to retain their loyalty. Service providers need to closely track the trend in their industry so they can position themselves for a competitive edge in the future. Evolving technology, Rejection, Fake Calls is also a big challenge in telemarketing.

1.4 Project Scope:

The proposed system can be implemented in any bank telemarketing company to aid their decision making process and predict the customers who are more likely to involve with the new upcoming product in the market.

1.5 Report Organization:

The report is organized into four chapters.

Chapter 2 is of literature review, a literature survey for this research is explained there. Different research paper represents the importance of using ID3 in extracting rules using decision tree are discussed in this chapter. The dataset used in research is widely benchmark data set which is then used for predicting the success of bank telemarketing.

Chapter 3 describes the logical design of research project. Use case and UML diagrams are drawn to represent the flow of project. Design makes the work simpler and easy to understand. Flowchart represents whole activity of project in a manner that’s easy to understand. Structural Diagram represents whole view of system. Time sequence represents whole activity with respect to time event. State machine is used to represent change from one state to another.

Chapter 4 is about the implementation of project of this research. It represents the tools used for implementation. Software and Hardware required for the project are mentioned. Dataset used is described. Decision tree algorithm used in project is explained.

Chapter 5 describes the results.

Chapter 6 is conclusion of whole research and future work we can do. The best part of research element are explored along with the mode of implementation is presented in this chapter of thesis.

LITERATURE REVIEW

There are number of research work available for predicting the success of bank telemarketing using different techniques.

Feature Selection for Prediction of Bank Telemarketing was proposed by Chakarin Vajiramedhin et al [1]. Telemarketing is an impressive technique for marketing both for direct and the predictive. This technique is mainly used by the banks for long term deposits. This is done through phone calls which make the customers exasperated. A useful approach called data mining approach is used for predicting and solving problems such problems. This approach may fail when there is a huge amount of data having many features. This paper focuses on the ways how to minimize the input features and then apply the predictive techniques to produce more accurate results. Results are measured by comparing to expected results with the actual results. In this way the performance of the system can be analyzed.

This paper introduces the concept of using enhanced ID3 algorithm DID3 for decision making especially in bank sector. This concept was proposed by Ahmed Bahgat El Seddawy et al [2] which was rarely discussed before. Decision support system a system is such a system that contains and manages information. This information could be individual data or data from external data sources. Its purpose is to do a deep understanding of data and then use it for decision making. Success of any decision system can measure by the quality of data entered, understanding level, availability, timely presentation. This can be used as a repository for internal and external data. Tools, models and user interface give enough information required for decision making Strategy making, implementation, and critical decision-making and evaluation are its key factors. The system uses classification technique by DID3algorithm which allows the top level management for decision making in any uncertain environment.

Sergio Moro et al presented a Data-Driven Approach to Predict the Success of Bank Telemarketing [3]. Data of calculate form the year of 2009 to 2014 which shows the results of crises in terms of finance. Data contain about 150 features which includes product name, job type, campaign, client name etc. A semi-automatic technique for feature selection was applied on this dataset which minimize the dataset into 22 features. Comparison of various algorithms were carried out which includes neural networks, decision trees etc. Results were examined through evaluation phase. Neural Networks produced the excellent results that classify clients. Two more models were included in neural network which highlights some key attributes such as call direction, bank agent experience etc. Such models and results were considered very important for the mangers.

Data mining approach for bank telemarketing using Rimer package and R tool was proposed by Paulo Cortez et al [4] .Mostly in banks direct telemarketing campaigns are carried out in order to examine client’s interests. The paper examines the trends of the market and extracts a valuable knowledge. This was done through data mining techniques using Rimer and R tool. It uses ROC and Lift curve Analysis methods for analysis for which neural basiyan and vector machine algorithms was undertaken. Results show that Support vector machine produces the excellent results and a useful knowledge was extracted. This extracted knowledge then becomes the basis for future pattern analysis. Now the mangers and the telemarketing companies are putting more emphasize on predicting techniques for pattern evaluation instead of tradition analysis techniques.

Bank Direct Marketing Based on Neural Network was proposed by Hany. A. Elsalamony, et al [5] .Bank telemarketing is all about customers. Classification of such a huge amount of data is impossible to handle for any human. This give rises to the development of some interesting patterns and procedures for decision making process. Data mining is a great approach to solve such problem both in terms of performance and accuracy. This paper analyses different models for classification based on previous and new researches. A new algorithm called as Multilayer Perception Neural Network (MLPNN ) with combining features of C5.0. This research aims to measure the performance of this model on real world data. It analyzed how to increase the efficiency by highlighting those factors that have an effect on measuring success. Performance of the system was measures using these measures such as classification correctness, understanding, and specificity.

Rupali Bhardwaj et al [6] presented ID3 algorithm in 1975. ID3 is a supervised algorithm builds decision tree from examples of fixed sets, than the resulting tree is used to classify the future samples. This algorithm is used for producing good classification results. It takes a given set of attributes and then inductive methods are used. Appropriate results if ID3 leads them to be used in intelligent systems as well. This paper uses ID3 algorithm implementations training set data of approximately two weeks for making decisions. Java language was used for predicting results. Results were concluded inform of a decision tree and rules generation. Thos shows that ID3 works well for any dataset having discrete values.

SYSTEM DESIGN

3.1 Introduction:

Design of specific system gives outlook or appearance of that system. It shows the functionality of the system, its performance and it also show data storing and manipulation property in order to accomplish its task.

Design has to exhibit following qualities:

- Should be Easy to understand and simple one.
- Must explain working of system completely and clearly.
- It can cover user needs
- Can be mold it in to any shape and
- Should be Reliable

It also shows particular method, procedures or set of procedures that has been followed for developing any project, particular to the branch of knowledge.

3.2 Proposed Model Design:

The design of the research is based on flow diagram, use case diagrams, structural diagram, time sequence diagram.

3.2.1 Logical Design (Flow Diagram):

The logical Diagram of system includes the flow diagram or block diagrams which are graphical or symbolic representation of system work.

illustration not visible in this excerpt

Figure 1 Flow Diagram of Complete System

[...]

Excerpt out of 28 pages

Details

Title
Design and Implementation of a Model to Predict the Success of the Bank Telemarketing
Course
Final Year Project
Grade
A
Authors
Year
2015
Pages
28
Catalog Number
V310165
ISBN (eBook)
9783668091313
ISBN (Book)
9783668091320
File size
1885 KB
Language
English
Notes
The authors are not native speakers of English. Please excuse any linguistic mistakes.
Tags
design, implementation, model, predict, success, bank, telemarketing
Quote paper
Salma Bibi (Author)Adeela Batool (Author)Fatima Mustafa (Author), 2015, Design and Implementation of a Model to Predict the Success of the Bank Telemarketing, Munich, GRIN Verlag, https://www.grin.com/document/310165

Comments

  • No comments yet.
Read the ebook
Title: Design and Implementation of a Model to Predict the Success of the Bank Telemarketing


Upload papers

Your term paper / thesis:

- Publication as eBook and book
- High royalties for the sales
- Completely free - with ISBN
- It only takes five minutes
- Every paper finds readers

Publish now - it's free