ML Questions Examples

What is batch normalization? What are its drawbacks? Describe normalizations that rectify them.
Explain bias-variance trade-off.
List common generative models and their pros and cons.
Why is it difficult to train vanilla GANs? How can you improve the training.
How would you prevent a neural network from overfitting?
Explain how to apply drop-out. Does it differ for train and test?
Give examples of neural-network optimizers, how do they work? What is gradient explosion? How to rectify it with activation functions?
Why neural networks might not be suitable for a given task? (check out this article)
Explain attention mechanism in transformers (these are now a general framework that can be applied to a wide range of problems).
Describe a dimensionality reduction technique of your choice.

Computer Vision questions:

NLP questions:

Why do we need positional encoding in transformers?
Describe several attention mechanisms, what are advantages and disadvantages?
What are the advantages of LSTMs versus vanilla RNNs?
What would we expect from a good embedding?
How to evaluate translation quality? Why BLEU is not a very good metric?
What is contrastive predictive coding, and why is it helpful?
What techniques for NLP data augmentation do you know?
How to learn rare words when training models?
Explain how language can be viewed as a graph. What properties would the graph have?

System questions:

How would you tell which one of two models is better?
How would you design a video in-painting/face transfer/normal map reconstruction (this is from computer graphics)/spam detector algorithm from scratch?
How to deal with data-drift in the deployed model?
How to design an algorithm, that given an image of damaged fence would estimate the insurance cost?