Ultimate Engineering Study Guide - Questions & Answers

31. What's wrong with this model architecture: (6, 13, 1) a. the model has too many layers b. the model has too few layers C. the model should have the same or fewer nodes from one layer to the next d. nothing, looks ok 32. This method to prevent overfitting shrinks weights: a. dropout b. early stopping C. L1 or L2 regularization d. maxpooling 33. This method to prevent overfitting randomly sets weights to 0: a. dropout b. early stopping C. L1 or L2 regularization d. maxpooling 34. Which loss function would you choose for a multiclass classification problem? a. MSE b. MAE C. binary crossentropy d. categorical crossentropy 35. Select ALL that are true. Advantages of CNNs for image data include: a. CNN models are simpler than sequential models b. a pattern learned in one location will be recognized in other locations C. CNNs can learn hierarchical features in data d. none of the above 36. A convolution in CNN: a. happens with maxpooling. b. happens as a filter slides over data c. happens with pooling d. happens with the flatten operation 37. True or false. Maxpooling reduces the dimensions of the data. 38. True or false. LSTM suffers more from the vanishing gradient problem than an RNN 39. True or false. LSTM is simpler than GRU and trains faster. 40. True or false. Embeddings project count or index vectors to higher dimensional floating-point vectors. 41. True or false. The higher the embedding dimension, the less data required to learn the embeddings. 42. True or false. An n-dimensional embedding represents a word in n-dimensional space. 43. True or false. Embeddings are learned by a neural network focused on word context.
17. This metric measures the percentage of items that were classified as + that were truly + TP/(TP + FP) a. precision b. recall C. accuracy d. F-measure 18. This metric is a balance of precision and recall. a. p-value b. accuracy C. F-measure d. none of the above 19. True or false. It is helpful to use a development set to tune parameters if we have a small amount of data. 20. True or false. Nave Bayes is a discriminative model. 21. True or false. Kappa ranges from 0 to 1. 22. True or false. The ideal AUC value is either +1 or -1. 23. This term refers to how well an algorithm can model different data sets. a. bias b. variance c. none of the above 24. Select ALL that are true. The purpose of adding a regularization term to an objective function is: a. to prevent underfitting b. to prevent overfitting c. to penalize large weights d. to penalize small weights 25. Select ALL that are true. Which are true about activation functions for neural networks: a. the sigmoid function output ranges from 0 to 1 b. the tanh function output ranges from -1 to +1 C. the rely output ranges from 0 to infinity d. the softmax function output sums to 1 26. True or false. Neural networks can have only one output 27. True or false. Logistic regression requires more feature engineering than neural networks. Deep Learning Questions 28. Trueor false. A layer represents a function that inputs tensors and outputs transformed tensors. 29. True or false. A model defines how neuro are put gether. 30. Select ALL that are true. Advantages of deep learning models over more shallow neural networks and traditional ML algorithms: a. they can learn more complex functions b. they can learn data representations at the same time as the function c. they train faster d. they require less data