Home
46.
What is the difference between Supervised Learning anUnsupervised Learning?

Unsupervised Learning?If an algorithm learns something from the training data so that the knowledgecan be applied to the test data, then it is referred to as Supervised Learning.Classification is an example for Supervised Learning. If the algorithm does notlearn anything beforehand because there is no response variable or any trainingdata, then it is referred to as unsupervised learning. Clustering is an example forunsupervised learning.

47.
What is the goal of A/B Testing?

It is a statistical hypothesis testing for randomized experiment with two variablesA and B. The goal of A/B Testing is to identify any changes to the web page tomaximize or increase the outcome of an interest. An example for this could beidentifying the click through rate for a banner ad.

48.
What is an Eigenvalue and Eigenvector?

Eigenvectors are used for understanding linear transformations. In data analysis,we usually calculate the eigenvectors for a correlation or covariance matrix.Eigenvectors are the directions along which a particular linear transformationacts by flipping, compressing or stretching. Eigenvalue can be referred to as thestrength of the transformation in the direction of eigenvector or the factor bywhich the compression occurs.

49.
How can outlier values be treated?
Outlier values can be identified by using univariate or any other graphicalanalysis method. If the number of outlier values is few then they can be assessedindividually but for large number of outliers the values can be substituted witheither the 99th or the 1st percentile values. All extreme values are not outliervalues.The most common ways to treat outlier values –
1) To change the value and bring in within a range
2) To just remove the value.

50.
How can you assess a good logistic model?

There are various methods to assess the results of a logistic regression analysis-

  • Using Classification Matrix to look at the true negatives and falsepositives.
  • Concordance that helps identify the ability of the logistic model todifferentiate between the event happening and not happening.
  • Lift helps assess the logistic model by comparing it with randomselection.