A Web Navigation Frame Work to Identify the Influence of Faculty on Students using Data mining Techniques

Present day students flock to the internet as the primary tool for researching any topic. In most of the cases, the students get influenced by different factors and these influences make them drive towards goal setting. This paper examines the student behavior on web and estimates the influence of a faculty teaching on their behavior [1]. Perfect procedures are needed to find out this inclination, thus, faculty rating given by the students and experience of the faculty in a particular field of specialization were taken into consideration. In this work indepth analysis of different kinds of students specifically related to the engineering group is concentrated [2,3].


Introduction
Present day students flock to the internet as the primary tool for researching any topic. In most of the cases, the students get influenced by different factors and these influences make them drive towards goal setting. This paper examines the student behavior on web and estimates the influence of a faculty teaching on their behavior [1]. Perfect procedures are needed to find out this inclination, thus, faculty rating given by the students and experience of the faculty in a particular field of specialization were taken into consideration. In this work indepth analysis of different kinds of students specifically related to the engineering group is concentrated [2,3].
Despite the fact that students join in an engineering college with the goal to receive degree, their future dreams are different [4,5]. This being the backdrop, we at GITAM University, have conducted a Brain storming session of about one week to motivate the students regarding various opportunities that are ahead before them. To understand the impact of this session on the students, we have provided specific IP address and monitored their web navigation pattern thereof.
A statistical framework is developed for clustering the students into different groups basing on their navigation pattern. The objective of this prediction is to characterize the student behavior relating to a particular cluster. After classifying the students to one of the predefined groups, regression analysis is measured to find out the relationship between the students browsing pattern and the influence of an experienced teacher or teacher who has been rated good.

Related work
The experience of the faculty plays a vital role in impressing, motivating and educating the students in different ways. There is a lot of difference between experienced and inexperienced faculty in the way they elucidate the topic to the students. In the brain storming session conducted to the students to guide them to the future endeavors we tested practically how the experience of the faculty helps in motivating the students towards their journey of success. Kunyanuth et al. [6] proposed a model in order to guide the students in choosing their track in the field of computer science. To select appropriate fields student registration data, course data and class learning were analyzed using data mining techniques. This research aims at developing a decision support system for guiding the students in choosing the correct field according to their abilities and interests. The data used in the experiments was collected from computer science program, Suan Sunandha Rajabhat University, during the period of 2006-2012. In the data gathering phase 4 quizzes based on computer science fields were conducted to the students in the subjects of data base, software engineering, multimedia and network and communication fields. The equal width method was used to partition the value of continuous attributes into five nominal values: VERY POOR, POOR, FAIR, GOOD and VERY GOOD. The data was analyzed by using naive Bayesian and decision trees classification techniques and the experimental results shown that naive Bayesian is more efficient than Decision trees. This research helps in deciding whether a teacher with good experience can have the effect on student's behavior or a teacher rated well impresses the student well towards the lecture. Now a days rating had become a common measure for measuring the quality of everything from person to goods, this research confirms that a qualitative teacher or faculty can't be judged with his rating but with his experience.

Dataset
This data set contains the sessionised data for the gitam.edu web server (http://www.gitam.edu). This data is based on the students navigating pattern for a period of one week during which the motivating session is delivered. The following snap shot presents a view of the dataset before preprocessing (Sample 1).
During preprocessing data is cleaned by removing whitespaces, images, audio and video files. The cleaned data is preprocessed for identifying sessions, different users; unique URL's or page views and time spent by them in each page view using different preprocessing techniques. As a first step in preprocessing all the Unique URL's in the dataset are identified and assigned unique identification numbers. In the second step, the dataset is presented with the student id, together

Abstract
Analyzing student web browsing behavior is a challenging task. This paper mainly focuses on a methodology to identify the influencing factor that has driven a student towards navigating a particular web site. Most of the research in this direction is untouched to estimate the influence of faculty on the student's behavioral patterns. In this work we focus on a novel statistical approach based on adaptive Gaussian mixture model, where the data clustered is given as input to the model to classify the student navigating pattern. The concept of regression analysis is used to find the relationship between student's navigational behavior and faculty's experience and rating. This article considers a real time dataset of GITAM University for experimentation.
groups. This is a model based cluster analysis technique that uses mixture of probability distributions to assign a data point to a cluster.
The basic latent class cluster model is given by Where y n is the nth observation of the manifest variables, S is the number of clusters and i π is the prior probability of membership in cluster j. j p is the cluster specific probability of j θ given the cluster specific parameters j θ . For each data point LCA calculates the probability to the cluster membership. After the model is built data points are assigned to the clusters that have higher probability.
After performing clustering, we identified six different groups in 400 students and when we analyzed the URL's the groups have browsed with his access sequence. As sequence is not of priority in the proposed work, the third step is carried out, in which each entry in the dataset is redesigned such that if the student visits the page the entry is represented by 1 else 0. The Tables 1-3 below elucidate the outputs of the three preprocessing steps respectively.

Clustering
As the dataset under consideration consists different navigation patterns, to mine the relevant browsing patterns clustering is considered. In our work we use this technique to identify groups in students with browsing patterns [7].

Latent class analysis
Latent class analysis is considered for clustering the students into

Classification
The objective of classification in this context is to assign the student to a particular cluster that describes the student behavior more relatively with his similar group.

Adaptive gaussian mixture model
AGMM is an improvised version of GMM in which the probability density is a function of input vector x, mean σ , standard deviation σ as equivalent to GMM [8] and with two additional parameters n and N. Where N is total number of samples present in the data and n is number of samples in each cluster.
The probability density function of Adaptive GMM is given by:  After classification the new student with access sequence is assigned to one of the above mentioned clusters.

Regression
A data mining (machine learning) technique used to fit an equation to a dataset is called Regression. This is a statistical model for estimating relationships among variables. Regression analysis with a single explanatory variable is termed simple regression or linear regression Linear regression, uses the formula of a straight line (y=mx + b) and determines the appropriate values for m and b to predict the value of y based upon a given value of x. Multiple regression, allow the use of more than one input variable and allow for the fitting of more complex models, such as a quadratic equation [4].

Conclusion
This paper presents the process of identifying different student clusters in a pool of students concentrating on different goals for their future, dynamically classifying a new student to one of the predefined clusters based on his behavioral pattern and then using regression to retrieve the effect of faculty experience and rating on students browsing behavior. Using regression analysis we concluded that the experience of the faculty impacts more on student's behavior.