Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2014 (2014), Article ID 506740, 18 pages
http://dx.doi.org/10.1155/2014/506740
Research Article

Using Data Crawlers and Semantic Web to Build Financial XBRL Data Generators: The SONAR Extension Approach

1Department of Informatics and Systems, University of Murcia, Espinardo, 30100 Murcia, Spain
2Bioinformatics at Centre for Plant Biotechnology and Genomics UPM-INIA, Polytechnic University of Madrid, Pozuelo de Alarcón, 28223 Madrid, Spain
3Computer Science Department, Carlos III University of Madrid, Leganés, 28911 Madrid, Spain

Received 26 August 2013; Accepted 24 October 2013; Published 23 January 2014

Academic Editors: H. Cheng and E. Denti

Copyright © 2014 Miguel Ángel Rodríguez-García et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. R. Pinsker and S. Li, “Costs and benefits of XBRL adoption: early evidence,” Communications of the ACM, vol. 51, no. 3, pp. 47–50, 2008. View at Publisher · View at Google Scholar · View at Scopus
  2. O. Abdelsalam and A. El-Masry, “The impact of board independence and ownership structure on the timeliness of corporate internet reporting of Irish-listed companies,” Managerial Finance, vol. 34, no. 12, pp. 907–918, 2008. View at Google Scholar
  3. A. R. Rahman, T. M. Tay, B. T. Ong, and S. Cai, “Quarterly reporting in a voluntary disclosure environment: its benefits, drawbacks and determinants,” International Journal of Accounting, vol. 42, no. 4, pp. 416–442, 2007. View at Publisher · View at Google Scholar · View at Scopus
  4. R. Debreceny and A. Rahman, “Firm-specific determinants of continuous corporate disclosures,” International Journal of Accounting, vol. 40, no. 3, pp. 249–278, 2005. View at Publisher · View at Google Scholar · View at Scopus
  5. M. J. Jones and J. Z. Xiao, “Financial reporting on the Internet by 2010: a consensus view,” Accounting Forum, vol. 28, no. 3, pp. 237–263, 2004. View at Publisher · View at Google Scholar · View at Scopus
  6. R. Debreceny, G. L. Gray, and A. Rahman, “The determinants of internet financial reporting,” Journal of Accounting and Public Policy, vol. 21, no. 4-5, pp. 371–394, 2002. View at Publisher · View at Google Scholar · View at Scopus
  7. A. S. Kelton and Y.-W. Yang, “The impact of corporate governance on Internet financial reporting,” Journal of Accounting and Public Policy, vol. 27, no. 1, pp. 62–87, 2008. View at Publisher · View at Google Scholar · View at Scopus
  8. J. L. Gandía, “Determinants of internet-based corporate governance disclosure by Spanish listed companies,” Online Information Review, vol. 32, no. 6, pp. 791–817, 2008. View at Publisher · View at Google Scholar · View at Scopus
  9. M. Ettredge and J. Gerdes Jr., “Timeliness of investor relations data at corporate web sites,” Communications of the ACM, vol. 48, no. 1, pp. 95–100, 2005. View at Publisher · View at Google Scholar · View at Scopus
  10. J. Kingston, B. Schafer, and W. Vandenberghe, “Towards a financial fraud ontology: a legal modelling approach,” Artificial Intelligence and Law, vol. 12, no. 4, pp. 419–446, 2004. View at Publisher · View at Google Scholar · View at Scopus
  11. M. Ettredge, V. J. Richardson, and S. Scholz, “Dissemination of information for investors at corporate Web sites,” Journal of Accounting and Public Policy, vol. 21, no. 4-5, pp. 357–369, 2002. View at Publisher · View at Google Scholar · View at Scopus
  12. R. Pinsker, S. Gara, and K. Karim, “XBRL usage: a socio-economic perspective,” Review of Business Information Systems, vol. 9, no. 4, pp. 59–72, 2005. View at Google Scholar
  13. R. Debreceny and G. L. Gray, “The production and use of semantically rich accounting reports on the Internet: XML and XBRL,” International Journal of Accounting Information Systems, vol. 2, no. 1, pp. 47–74, 2001. View at Publisher · View at Google Scholar · View at Scopus
  14. A. Kambil, “What is your Web 5.0 strategy?” Journal of Business Strategy, vol. 29, no. 6, pp. 56–58, 2008. View at Publisher · View at Google Scholar · View at Scopus
  15. R. A. Weber, “XML, XBRL, and the future of business and business reporting,” in Trust and Data Assurances in Capital Markets: The Role of Technology Solutions, S. J. Roohani, Ed., Bryant College, Smithfield, Va, USA, 2003. View at Google Scholar
  16. F. D. Hodge, J. J. Kennedy, and L. A. Maines, “Does search-facilitating technology improve the transparency of financial reporting?” Accounting Review, vol. 79, no. 3, pp. 687–703, 2004. View at Google Scholar · View at Scopus
  17. O. Duangploy and D. Gay, “International harmonization impact compared: illustration of United States and Japan Financial Statement Ratio Analysis,” Journal of American Academy of Business, vol. 6, no. 1, pp. 225–230, 2005. View at Google Scholar
  18. R. F. Premuroso and S. Bhattacharya, “Do early and voluntary filers of financial information in XBRL format signal superior corporate governance and operating performance?” International Journal of Accounting Information Systems, vol. 9, no. 1, pp. 1–20, 2008. View at Publisher · View at Google Scholar · View at Scopus
  19. T. Berners-Lee, J. Hendler, and O. Lassila, “The semantic web,” Scientific American, vol. 284, no. 5, pp. 34–43, 2001. View at Publisher · View at Google Scholar · View at Scopus
  20. R. Studer, V. R. Benjamins, and D. Fensel, “Knowledge engineering: principles and methods,” Data and Knowledge Engineering, vol. 25, no. 1-2, pp. 161–197, 1998. View at Google Scholar · View at Scopus
  21. H.-C. Yang, “Automatic generation of semantically enriched web pages by a text mining approach,” Expert Systems with Applications, vol. 36, no. 6, pp. 9709–9718, 2009. View at Publisher · View at Google Scholar · View at Scopus
  22. S. B. Park, S. S. Kim, S. Oh, Z. Zeong, H. Lee, and S. R. Park, “Target concept selection by property overlap in ontology population,” International Journal of Computer Science, vol. 3, no. 1, pp. 14–18, 2008. View at Google Scholar
  23. T. C. Du, F. Li, and I. King, “Managing knowledge on the Web: extracting ontology from HTML Web,” Decision Support Systems, vol. 47, no. 4, pp. 319–331, 2009. View at Publisher · View at Google Scholar · View at Scopus
  24. T. R. Gruber, “A translation approach to portable ontology specifications,” Knowledge Acquisition, vol. 5, no. 2, pp. 199–220, 1993. View at Publisher · View at Google Scholar · View at Scopus
  25. S. Staab, R. Studer, H.-P. Schnurr, and Y. Sure, “Knowledge processes and ontologies,” IEEE Intelligent Systems and Their Applications, vol. 16, no. 1, pp. 26–34, 2001. View at Publisher · View at Google Scholar · View at Scopus
  26. N. F. Noy and M. A. Musen, “The PROMPT suite: interactive tools for ontology merging and mapping,” International Journal of Human Computer Studies, vol. 59, no. 6, pp. 983–1024, 2003. View at Publisher · View at Google Scholar · View at Scopus
  27. T. Wahl and G. Sindre, “A survey of development methods for semantic web service systems,” International Journal of Information Systems in the Service Sector, vol. 1, no. 2, pp. 1–16, 2009. View at Google Scholar
  28. M. Shamsfard and A. A. Barforoush, “Learning ontologies from natural language texts,” International Journal of Human Computer Studies, vol. 60, no. 1, pp. 17–63, 2004. View at Publisher · View at Google Scholar · View at Scopus
  29. A. Maedche and S. Staab, “Ontology learning,” in Handbook on Ontologies, International Handbooks on Information Systems, S. Staab and R. Studer, Eds., pp. 173–190, Springer, 2004. View at Google Scholar
  30. J. M. Ruiz-Martínez, R. Valencia-García, R. Martínez-Béjar, and A. Hoffmann, “BioOntoVerb: a top level ontology based framework to populate biomedical ontologies from texts,” Knowledge Based Systems, vol. 36, pp. 68–80, 2012. View at Google Scholar
  31. T. Sugibuchi and Y. Tanaka, “Interactive web-wrapper construction for extracting relational information from web documents,” in Proceedings of the 14th International World Wide Web Conference (WWW '05), pp. 968–969, May 2005. View at Publisher · View at Google Scholar · View at Scopus
  32. A. Pan, J. Raposo, M. Álvarez, V. Carneiro, and F. Bellas, “Automatically maintaining navigation sequences for querying semi-structured web sources,” Data and Knowledge Engineering, vol. 63, no. 3, pp. 793–808, 2007. View at Publisher · View at Google Scholar · View at Scopus
  33. V. I. Levenshtein, “Binary codes capable of correcting deletions, insertions, and reversals,” Soviet Physics-Doklady, vol. 10, no. 8, pp. 707–710, 1966. View at Google Scholar
  34. H. Namgoong and H.-G. Kim, “Ontology-based controlled natural language editor using cfg with lexical dependency,” in The Semantic Web, vol. 4825, pp. 353–366, 2007. View at Google Scholar
  35. H.-G. Kim, B.-H. Ha, J.-I. Lee, and M.-K. Kim, “A multi-layered application for the gross description using Semantic Web technology,” International Journal of Medical Informatics, vol. 74, no. 5, pp. 399–407, 2005. View at Publisher · View at Google Scholar · View at Scopus
  36. A. Bernstein and E. Kaufmann, “Gino: a guided input natural language ontology editor,” in Proceedings of the 5th International Semantic Web Conference (ISWC '06), vol. 4273 of Lecture Notes in Computer Science, pp. 144–157, 2006.
  37. B. McBride, “Jena: a semantic web toolkit,” IEEE Internet Computing, vol. 6, no. 6, pp. 55–58, 2002. View at Publisher · View at Google Scholar · View at Scopus
  38. C. Wang, M. Xiong, Q. Zhou, and Y. Yu, “PANTO: a portable natural language interface to ontologies,” in Proceedings of the 4th European Semantic Web Conference (ESWC '07), vol. 4519 of Lecture Notes in Computer Science, pp. 473–487, June 2007. View at Scopus
  39. E. Kaufmann, A. Bernstein, and L. Fischer, “NLP-reduce: a “naive” butdomain-independent natural language interface for querying ontologies,” in Proceedings of the 4th European Semantic Web Conference, 2007.
  40. E. Sirin, B. Parsia, B. C. Grau, A. Kalyanpur, and Y. Katz, “Pellet: a practical OWL-DL reasoner,” Web Semantics, vol. 5, no. 2, pp. 51–53, 2007. View at Publisher · View at Google Scholar · View at Scopus
  41. E. Kaufmann and A. Bernstein, “How useful are natural language interfaces to the semantic web for casual end-users?” in Proceedings of the 6th International Semantic Web Conference and the 2nd Asian Semantic Web Conference (ISWC '07; ASWC '07), vol. 4825, pp. 281–294, 2007.
  42. P. R. Smart, “Controlled natural languages and the semantic web,” Technical Report ITA/P12/SemWebCNL, School of Electronics and Computer Science, University of Southampton, 2008. View at Google Scholar
  43. S. Arthur and S. M. Sheffrin, Economics Principles in Action, Prentice Hall, Upper Saddle River, NJ, USA, 2003.
  44. D. D. Hawley, “Artificial neural systems: a new tool for financial decision-making,” Financial Analysts Journal, vol. 46, no. 6, pp. 63–72, 1990. View at Google Scholar
  45. F. E. H. Tay and L. Shen, “Economic and financial prediction using rough sets model,” European Journal of Operational Research, vol. 141, no. 3, pp. 641–659, 2002. View at Publisher · View at Google Scholar · View at Scopus
  46. J. M. Gomez, F. Garcia-Sanchez, R. Valencia-Garcia, I. Toma, and C. Garcia-Moreno, “SONAR: a semantically empowered financial search engine,” in Proceedings of the International Work Conference on the Interplay between Natural and Artificial Computation, vol. 5601 of Lecture notes in computer science, pp. 405–414, 2009.
  47. H. H. Zhu and S. Madnick, “Semantic Integration Approach to Efficient Business Data Supply Chain: Integration Approach to Inter-Operable XBRL,” MIT Sloan School of Management Research Paper Series, 2007. View at Google Scholar
  48. R. Garcia and R. Gil, “Facilitating business interoperability from the semantic web,” in Business Information Systems, pp. 220–232, 2007. View at Google Scholar
  49. T. Declerck and H. U. Krieger, “Translating XBRL into description logic: an approach using protege, sesame and OWL,” in Business Information Systems, 2006. View at Google Scholar
  50. R. García and R. Gil, “Triplificating and linking XBRL financial data,” in Proceedings of the 6th International Conference on Semantic Systems (I-SEMANTICS '10), September 2010. View at Publisher · View at Google Scholar · View at Scopus
  51. I. García-Manotas, E. Lupiani, F. García-Sánchez, and R. Valencia-García, “Populating knowledge based decision support systems,” International Journal of Decision Support Systems, vol. 2, no. 1, pp. 1–20, 2010. View at Google Scholar
  52. P. Castells, B. Foncillas, R. Lara, M. Rico, and J. L. Alonso, “Semantic web technologies for economic and financial information management,” in Proceedings of the 1st European Semantic Web Symposium (ESWS '04), pp. 473–487, 2004.
  53. M. S. Fox, M. Barbuceanu, M. Gruninger, and J. Lin, “An organizational ontology for enterprise modelling,” in Simulating Organizations: Computational Models of Institutions and Groups, pp. 131–152, MIT Press, Cambridge, Mass, USA, 1998. View at Google Scholar
  54. C. Partridge and M. Stefanova, “A Synthesis of State of the Art Enterprise Ontologies, Lessons Learned,” The BORO Program, LADSEB CNR., 2001.
  55. A. S. Losada, J. L. Bas, S. Bellido, J. Contreras, R. Benjamins, and J. M. Gomez, “WP10: Case Study eBanking D10. 7 Financial Ontology. Data, Information and Process Integration with Semantic Web Services,” Tech. Rep. FP6-507483, 2005. View at Google Scholar
  56. XBRL International, “XBRL: eXtensible Business Reporting Language,” 2009, http://www.xbrl.org.
  57. R. Valencia-García, F. García-Sánchez, D. Castellanos-Nieves, and J. T. Fernández-Breis, “OWLPath: an OWL ontology-guided query editor,” IEEE Transactions on Systems, Man, and Cybernetics A, vol. 41, no. 1, pp. 121–136, 2011. View at Publisher · View at Google Scholar · View at Scopus
  58. “IPP Taxonomy,” 2011, http://www.xbrl.es/informacion/ipp.html.
  59. S. Méndez, J. Labra, J. De-Andrés, and P. Ordoñez, “Analysis of XBRL documents containing accounting information of listed firms using Semantic Web Technologies,” in Metadata and Semantics, M. A. Sicilia and M. D. Lytras, Eds., pp. 375–381, 2009. View at Google Scholar
  60. Y. Qi, K. Selçuk Candan, J. Tatemura, S. Chen, and F. Liao, “Supporting OLAP operations over imperfectly integrated taxonomies,” in Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD '08), pp. 875–888, June 2008. View at Publisher · View at Google Scholar · View at Scopus
  61. C. K. Selçuk, M. Cataldi, M. L. Sapino, and C. Schifanella, “Structure- and extension-informed taxonomy alignment,” in Proceedings of the 4th International VLDB Workshop on Ontology-based Techniques for DataBases in Information Systems and Knowledge Systems (ODBIS '08), 2008.
  62. C. W. Cleverdon, J. Mills, and E. M. Keen, Factors Determining the Performance of Indexing Systems, College of Aeronautics, Cranfield, UK, 1966.
  63. C. J. van Rijsbergen, Information Retrieval, Butterworth-Heinemann, Boston, Mass, USA, 1979.
  64. Á. García-Crespo, R. Colomo-Palacios, J. M. Gómez-Berbís, and B. Ruiz-Mezcua, “SEMO: a framework for customer social networks analysis based on semantics,” Journal of Information Technology, vol. 25, no. 2, pp. 178–188, 2010. View at Publisher · View at Google Scholar · View at Scopus
  65. K. Gao and S. Li, “The cooperation model for multi-agents and the identification on replicated collections for web crawler,” International Journal of Modelling, Identification and Control, vol. 11, no. 3-4, pp. 224–231, 2010. View at Publisher · View at Google Scholar · View at Scopus
  66. D. H.-L. Goh, A. Chua, C. S. Lee, and K. Razikin, “Resource discovery through social tagging: a classification and content analytic approach,” Online Information Review, vol. 33, no. 3, pp. 568–583, 2009. View at Publisher · View at Google Scholar · View at Scopus
  67. C. C. Chen and Y.-D. Tseng, “Quality evaluation of product reviews using an information quality framework,” Decision Support Systems, vol. 50, no. 4, pp. 755–768, 2011. View at Publisher · View at Google Scholar · View at Scopus
  68. D. Jannach, K. Shchekotykhin, and G. Friedrich, “Automated ontology instantiation from tabular web sources: the allRight system,” Journal of Web Semantics, vol. 7, no. 3, pp. 136–153, 2009. View at Publisher · View at Google Scholar · View at Scopus
  69. S.-H. Lin, K.-P. Chu, and C.-M. Chiu, “Automatic sitemaps generation: exploring website structures using block extraction and hyperlink analysis,” Expert Systems with Applications, vol. 38, no. 4, pp. 3944–3958, 2011. View at Publisher · View at Google Scholar · View at Scopus