Data Science Technical Interview Questions (MCQ): Test Your Knowledge! What is the F1 score?The F1 score is a weighted average of precision and recallThe F1 score is the sum of precision and recallThe F1 score is the difference between precision and recallThe F1 score is the product of precision and recallWhat is the role of machine learning in chatbots?To enable the chatbot to learn and improve its responses over timeTo hardcode responses for the chatbotTo perform data visualizationTo generate static contentWhat is the primary goal of data normalization in databases?To make data abnormalTo organize data to reduce redundancyTo increase data complexityTo encrypt normal formsWhat is a common technique for feature scaling in machine learning?StandardizationNormalizationOne-hot encodingLabel encodingWhich algorithm is commonly used for anomaly detection in images?Linear regressionNaive BayesIsolation ForestDecision TreesWhat does GUI stand for in software design?General User IntegrationGraphical User InterfaceGuided Universal InteractionGenerated Utility ImplementationWhich of these is NOT a common method for natural language generation?Rule-basedStatisticalNeuralRandom generationWhat is the main purpose of data lineage?To create family trees for dataTo track the origin and transformations of dataTo increase data volumeTo encrypt data pathsWhat is spaCy and its diff. from NLTK? spaCy: NLP lib. Focuses on prod. use Efficient, supports deep learning All of the aboveWhat is the main purpose of generative adversarial networks?To generate adversariesTo learn data distributions and generate new samplesTo reduce network complexityTo encrypt generated dataWhat is the primary goal of LIME (Local Interpretable Model-agnostic Explanations)?To increase model accuracyTo explain individual predictionsTo speed up model trainingTo reduce model complexityWhat is the difference between precision and accuracy?Precision is consistency, accuracy is correctnessAccuracy is consistency, precision is correctnessThey are the sameNeither relates to consistencyWhich of these is an example of unsupervised learning?Linear regressionLogistic regressionPrincipal Component AnalysisRandom forestWhat is the main purpose of dropout?To drop out of trainingTo prevent overfittingTo reduce model sizeTo increase dropout rateWhich statement about machine learning is true?It cannot predict outcomesIt always needs labeled dataIt can learn from past dataIt does not require any dataWhat is a real-world application of clustering?Customer segmentation for targeted marketingPredicting stock pricesClassifying email as spam or not spamReducing data dimensionalityWhat is the difference between precision and recall?They are the samePrecision focuses on false positives, recall on false negativesRecall focuses on false positives, precision on false negativesBoth measure the same thingWhat is boosting in machine learning?A technique to reduce model complexityAn ensemble method that combines weak learnersA method to increase training data sizeA way to decrease model training timeWhat is the difference between machine learning and general programming? ML uses data to learn GP uses instructions ML adapts to new data GP follows fixed rulesWhat does MAE stand for in regression analysis?Mean Absolute ErrorMaximum Average ErrorMean Absolute EstimationMedian Average ErrorWhat is time series analysis?Time series analysis is a set of techniques for analyzing time series dataTime series analysis is a type of databaseTime series analysis is a tool for data visualizationTime series analysis is a type of machine learning algorithmWhich of these is NOT a common technique for handling imbalanced datasets?OversamplingUndersamplingSMOTEOverfittingWhich is not a type of data visualization chart?Bar chartLine chartPie chartQuantum chartWhat is the process of dividing a dataset into smaller, more manageable subsets for training and testing a model?Data splittingData cleaningData transformationData aggregationWhat does the Bias-Variance tradeoff address?The speed of the modelThe balance between model complexity and generalizationThe size of the training dataThe number of layers in a neural network Score: 0/25 Retake Quiz Next Set of Questions