Functional Architecture of integrated framework for Facet-based Data Collection and Analysis
Keywords:
Data Analysis, Integrated Framework, Intelligent Service, Text Data Collection, Web Crawling.Abstract
We present in this paper an integrated framework for collection and analysis of Facet-based text data. The integrated framework consists of four components: (1) user interface, (2) web crawler, (3) data analyzer, and (4) database (DB). User interface is used to set input Facet and option values for web crawling and text data analysis using a graphical user interface (GUI). In fact, it offers outcomes of research by data visualization. The web crawler collects text data from articles posted on the web based on input Facets. The data analyzer classifies papers in "relevant articles" (i.e., word sets to be included on such posts) and "nonrelevant articles" with predefined information. It then analyzes the text data of the relevant articles and visualizes the results of the data analysis. Ultimately, the DB holds the generated text information, the predefined user-defined expertise and the outcomes of data analysis and data visualization. We verify the feasibility of an integrated framework by means of proof of concept (PoC) prototyping. The experimental results show that the implemented prototype reliably collects and analyzes the text data of the articles.
Downloads
References
C.Dobre and F. Xhafa: Future Gener. Comp. Syst. 37 (2014) 267.
W. Raghupathi and V. Raghupathi: Health Inf. Sci. Syst. 2 (2014).
Z. Khan, A. Anjum, and S. L. Kiani: Proc. 2013 IEEE/ACM sixth Int. Conf. Utility and Cloud Computing (IEEE, 2013) 381.
A. Sheth, C. Henson, and S. S. Sahoo: IEEE Internet Comput. 12 (2008) 78.
S. Landset, T. M. Khoshgoftaar, A. N. Richter, and T. Hasanin:
J. Enormous Data 2 (2015) 24.
F. Morandat, B. Slope, L. Osvald, and J. Vitek: Proc. European Conf. Item Oriented Programming (Springer, 2012) 104.
B. C. D. S. Oliveira and J. Gibbons: J. Funct. Program. 20 (2010) 303.
A. Mesbah, A. V. Deursen, and S. Lenselink: ACM Trans. Web 6 (2012) 1.
Y. Zhang: IEEE Trans. Serv. Comput. 9 (2016) 786.
S. Wang, C. Zhang, and D. Li: Proc. Int. Conf. Modern IoT Technologies and Applications (Springer, 2016)
R Foundation: https://www.r-project.org/(got to July 2017).
Oracle: http://www.oracle.com/technetwork/java/javase/downloads/jdk8-down loads-2133151.html (got toJuly 2017). ZDNet: http://www.zdnet.com/(got to July 2017)
Downloads
Published
How to Cite
Issue
Section
License
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.