Rodrigues, A; Silva, C; Borges, PVK; Silva, S; Dutra, I;
2015 IEEE International Conference on Smart City/SocialCom/SustainCom, SmartCity 2015, Chengdu, China, December 19-21, 2015
Statistical data analysis methods are well known for their difficulty in handling large number of instances or large number of parameters. This is most noticeable in the presence of "big data", i.e., of data that are heterogeneous, and come from several sources, which makes their volume increase very rapidly. In this paper, we study popular and well-known statistical functions generally applied to data analysis, and assess their performance using our own implementation (DataIP) 1, MatLab and R. We show that DataIP outperforms MatLab and R by several orders of magnitude and that the design and implementation of these functions need to be rethought to adapt to today's data challenges. © 2015 IEEE.
Rodrigues, AV; Jorge, A; Dutra, I;
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.