- What are the three types of EMR nodes and their purposes?
- master node - orchestrates other nodes and holds logs
- core node - performs tasks and does write to HDFS (Hadoop File System)
- task node - performs tasks but does not write to HDFS (Hadoop File System)
- What is the difference between kinesis streams and kinesis firehose?
- Kinesis streams has shards that persist your data, while the data in firehose has to be dealt with on the spot.
- What is kinesis analytics?
- Kinesis analytics sits on top of either firehose or streams and analyzes the data in real time and can output it to a persistent storage.