# Agenda 1. Definition of problem statement and deliverables 1.1 Background information on data flow architecture/API <mark>Show them the current used case1/2/3 flow + latest diagram and datamining used case </mark> 1.2 Scope of the research, if available <mark>Problem Statement:</mark> - <mark>No analytic and visualization of the existing TF-AMD data</mark> - <mark>Harnessing the hidden insight from the existing data structured and non-structured </mark> - <mark>No standardized data repository to gather all the data from numerous data source to cater for user requested use case</mark> 3. Will AMD provide any manpower? a) AMD staff doing part-time postgrad? <mark>Time being no. One engineer will be allocated to work with UTAR pertaining to Big Data</mark> 5. After project is over, will AMD handle own maintainence? <mark>Yes</mark> 7. Any hardware support? Or software requirements? (Sql and Hadoop) - <mark>We prefer to handle & manage the HW infra (on premis or cloud), but if the university have the infra + software may be we can tap to that for POC</mark> - <mark>Big data Software Module is open for proposal, prefer all repository in HDFS data lake format to facilitate Big data analitic process</mark> 9. Scope 1. <mark>Infra and Big Data Framework setup + HDFS Data Lake</mark> 2. <mark>ETL on structure and un-structure data to HDFS data lake</mark> 3. <mark>Analitic base on use case data via ETL</mark> 4. <mark>Visualization of the data as needed</mark> 10. Others - <mark>NDA</mark>