Latest Data Science and data analytics Interview questions and answers for free

General Knowledge
General Aptitude
General English
Networking
Interview Questions
- .NET
- Java
- ASP.NET
- C++
- Perl
- Python
- Ruby and Rails
- Struts
- Core Java
- Hibernate
- DB2
- MS SQL Server
- MySQL
- Oracle
- SQL
- DBMS
- Data Warehousing
- Data structures and Algorithms
- CCNA
- CCNP Routing
- CCNP Switching
- Internetworking
- Border Gateway Protocol
- MCSE
- Exchange Server
- Windows Server 2008
- DNS & Active Directory
- Firewall Questions
- Unix
- Linux Server Administrator
- Linux System Administrator
- Linux File Manipulation
Database

Home
Interview Questions
Data Science

41.

What is the difference between Cluster and Systematic Sampling?

Cluster sampling is a technique used when it becomes difficult to study the targetpopulation spread across a wide area and simple random sampling cannot beapplied. Cluster Sample is a probability sample where each sampling unit is acollection, or cluster of elements. Systematic sampling is a statistical techniquewhere elements are selected from an ordered sampling frame. In systematicsampling, the list is progressed in a circular manner so once you reach the end of the list,it is progressed from the top again. The best example for systematic sampling is equal probability method.

42.

Are expected value and mean value different?

They are not different but the terms are used in different contexts. Mean isgenerally referred when talking about a probability distribution or samplepopulation whereas expected value is generally referred in a random variablecontext.

For Sampling Data
Mean value is the only value that comes from the sampling data.Expected Value is the mean of all the means i.e. the value that is built frommultiple samples. Expected value is the population mean.For Distributions
Mean value and Expected value are same irrespective of the distribution, underthe condition that the distribution is in the same population.

43.

What does P-value signify about the statistical data?

P-value is used to determine the significance of results after a hypothesis test instatistics. P-value helps the readers to draw conclusions and is always between 0and 1.

P- Value > 0.05 denotes weak evidence against the null hypothesis whichmeans the null hypothesis cannot be rejected.
P-value <= 0.05 denotes strong evidence against the null hypothesiswhich means the null hypothesis can be rejected.
P-value=0.05is the marginal value indicating it is possible to go either way.

44.

Do gradient descent methods always converge to same point?

No, they do not because in some cases it reaches a local minima or a localoptima point. You don’t reach the global optima point. It depends on the data andstarting conditions

45.

A test has a true positive rate of 100% and false positive rate of 5%.There is a population with a 1/1000 rate of having the condition the testidentifies. Considering a positive test, what is the probability of having thatcondition?

Let’s suppose you are being tested for a disease, if you have the illness the test will end up saying you have the illness. However, if you don’t have the illness-5% of the times the test will end up saying you have the illness and 95% of thetimes the test will give accurate result that you don’t have the illness. Thus thereis a 5% error in case you do not have the illness.

Out of 1000 people, 1 person who has the disease will get true positive result.
Out of the remaining 999 people, 5% will also get true positive result.
Close to 50 people will get a true positive result for the disease.

This means that out of 1000 people, 51 people will be tested positive for thedisease even though only one person has the illness. There is only a 2%probability of you having the disease even if your reports say that you have thedisease.

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20

Sitemap