Increase in use of stand-alone systems for recording data has made data harvesting across several fields easier and faster. However, raw data collected from such systems need to be manipulated and processed to enable meaningful analysis. Although data are readily available, one major issue concerning analysts and scientists is collation of data from various sources. Without a standard data format, scientists and analysts are required to put in resources to bring in data from multiple sources together. The problem is aggravated when a particular data source changes its data format.
Comments
Technical Report: UTEP-CS-11-19