Apr 24, 2024  
2019-2020 Graduate Catalog 
    
2019-2020 Graduate Catalog [ARCHIVED CATALOG]

Add to Portfolio (opens a new window)

MDATA 510 - Data Engineering and Data Munging


Credits: (3)
This course focuses on the data acquisition, cleaning, manipulation, transformation, and analysis portions of the data science lifecycle.  Data engineering and data munging techniques related to a variety of formats are covered.  Open data repositories from government organizations as well as social media sites and databases are explored.  The R computing environment continues to be leveraged while also investigating numerous ancillary tools within the R community related to data transformation and visualization.  Concepts that were discussed at a higher level in the introductory course are now investigated more fully against real world raw data that requires manipulation to achieve the “tidy” format.  Projects throughout course provide opportunity to demonstrate mastery of data science concepts from data acquisition through data engineering and statistical analysis portions of lifecycle. Prerequisite(s): MDATA 500   



Add to Portfolio (opens a new window)