ML Questions Examples
- What is batch normalization? What are its drawbacks? Describe normalizations that rectify them.
- Explain bias-variance trade-off.
- List common generative models and their pros and cons.
- Why is it difficult to train vanilla GANs? How can you improve the training.
- How would you prevent a neural network from overfitting?
- Explain how to apply drop-out. Does it differ for train and test?
- Give examples of neural-network optimizers, how do they work? What is gradient explosion? How to rectify it with activation functions?
- Why neural networks might not be suitable for a given task? (check out this article)
- Explain attention mechanism in transformers (these are now a general framework that can be applied to a wide range of problems).
- Describe a dimensionality reduction technique of your choice.
Computer Vision questions:
- Why CNNs are so good when applied to images?
- How to apply deconvolution?
- Why transformers are challenging to apply to images?
- How would you map emotions from one face to another?
- What is temporal consistency and how to achieve it?
- What loss functions can you use for a generative model?
- Why FID score might is not the best realism metric?
- Describe convolution types and the motivation behind them.
- What is instance normalization?
- Describe loss functions for image reconstruction and their pros and cons.
NLP questions:
- Why do we need positional encoding in transformers?
- Describe several attention mechanisms, what are advantages and disadvantages?
- What are the advantages of LSTMs versus vanilla RNNs?
- What would we expect from a good embedding?
- How to evaluate translation quality? Why BLEU is not a very good metric?
- What is contrastive predictive coding, and why is it helpful?
- What techniques for NLP data augmentation do you know?
- How to learn rare words when training models?
- Explain how language can be viewed as a graph. What properties would the graph have?
System questions:
- How would you tell which one of two models is better?
- How would you design a video in-painting/face transfer/normal map reconstruction (this is from computer graphics)/spam detector algorithm from scratch?
- How to deal with data-drift in the deployed model?
- How to design an algorithm, that given an image of damaged fence would estimate the insurance cost?