# AWS Data Analyst
The main services are EMR, Redshift, Kinesis and S3+Glue+Athena. That makes approximately 80% of the exam.
Services to know for the exam
Glue & Glue Data Catalog
S3
DynamoDB
Kinesis Data Streams
Kinesis Firehose
Redshift and Spectrum
EMR and Hadoop Ecosystem
Lambda
Athena
Quicksight
Kinesis Data Analytics
ElasticSearch and Kibana
RDS & Aurora
Amazon MSK
IOT Core
Data Pipeline
DMS
Snowball and Direct Connect
SageMaker
Security considerations for all above services.
https://d0.awsstatic.com/whitepapers/Big_Data_Analytics_Options_on_AWS.pdf
https://d0.awsstatic.com/whitepapers/whitepaper-streaming-data-solutions-on-aws-with-amazon-kinesis.pdf
• Amazon EMR Migration Guide: How to Move Apache Spark and
Apache Hadoop From On-Premises to AWS
• Big Data Options on AWS
• Lambda Architecture for Batch and Stream Processing
• Streaming Data Solutions on AWS with Amazon Kinesis
• Teaching Big Data Skills with Amazon EMR
• Reference Architecture: SQL Based Data Processing in Amazon
ECS
External schema and table for Amazon Redshift Spectrum
https://www.aws.training/Details/eLearning?id=46612
https://www.aws.training/Details/eLearning?id=35364
Kinesis Best Practices
https://www.youtube.com/watch?v=MELPeni0p04&ab_channel=AmazonWebServices
https://aws.amazon.com/blogs/aws/new-amazon-kinesis-data-analytics-for-java/
https://docs.aws.amazon.com/streams/latest/dev/kinesis-using-sdk-java-after-resharding.html
https://aws.amazon.com/blogs/big-data/under-the-hood-scaling-your-kinesis-data-streams/
EMR Best Practices
https://www.youtube.com/watch?v=dU40df0Suoo&ab_channel=AWSOnlineTechTalks
https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-emrfs-iam-roles.html
https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-gs-process-sample-data.html
https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-block-public-access.html
Athena
https://aws.amazon.com/blogs/big-data/top-10-performance-tuning-tips-for-amazon-athena/
https://docs.aws.amazon.com/athena/latest/ug/other-notable-limitations.html
https://docs.aws.amazon.com/athena/latest/ug/user-created-workgroups.html
https://docs.aws.amazon.com/athena/latest/ug/compression-formats.html
https://aws.amazon.com/about-aws/whats-new/2019/02/athena_workgroups/
ES
https://aws.amazon.com/premiumsupport/knowledge-center/high-jvm-memory-pressure-elasticsearch/
Glue
https://docs.aws.amazon.com/glue/latest/dg/monitor-data-warehouse-schedule.html
https://aws.amazon.com/blogs/big-data/optimizing-downstream-data-processing-with-amazon-kinesis-data-firehose-and-amazon-emr-running-apache-spark/
https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-python-transforms.html
https://docs.aws.amazon.com/glue/latest/dg/built-in-transforms.html
Redshift
https://docs.aws.amazon.com/redshift/latest/dg/c_choosing_dist_sort.html
https://docs.aws.amazon.com/redshift/latest/dg/c_loading-data-best-practices.html
https://docs.aws.amazon.com/redshift/latest/dg/cm-c-implementing-workload-management.html
https://docs.aws.amazon.com/redshift/latest/dg/concurrency-scaling.html
https://aws.amazon.com/blogs/big-data/top-10-performance-tuning-techniques-for-amazon-redshift/
https://aws.amazon.com/blogs/big-data/amazon-redshift-engineerings-advanced-table-design-playbook-distribution-styles-and-distribution-keys/
https://docs.aws.amazon.com/redshift/latest/mgmt/security-key-management.html
https://docs.aws.amazon.com/redshift/latest/dg/c_choosing_dist_sort.html
https://docs.aws.amazon.com/redshift/latest/dg/c_workload_mngmt_classification.html
Redshift Spectrum
https://docs.aws.amazon.com/redshift/latest/dg/c-getting-started-using-spectrum.html
https://aws.amazon.com/blogs/big-data/10-best-practices-for-amazon-redshift-spectrum/
https://docs.aws.amazon.com/redshift/latest/mgmt/changing-cluster-encryption.html
Quicksight
https://docs.aws.amazon.com/quicksight/latest/user/restrict-access-to-a-data-set-using-row-level-security.html
https://docs.aws.amazon.com/dms/latest/userguide/CHAP_Troubleshooting.html
https://docs.aws.amazon.com/streams/latest/dev/developing-producers-with-sdk.html
https://docs.aws.amazon.com/glue/latest/dg/glue-troubleshooting-errors.html
https://docs.aws.amazon.com/dms/latest/userguide/CHAP_Troubleshooting.html
https://docs.aws.amazon.com/streams/latest/dev/troubleshooting-producers.html
https://docs.aws.amazon.com/streams/latest/dev/troubleshooting-consumers.html