Metadata Extraction from References of Different Styles
Metadata extraction is the process of describing extrinsic and intrinsic qualities of the resource such as document, image, video, including getting data from references. References form an essential part of electronic scholarly publications. A reference is the way of giving acknowledgment to individuals for their creative and intellectual works that one utilized in his or her research work. It can also be used to locate particular sources and combat plagiarism. A reference style dictates the information necessary for a reference and how the information is ordered. Accurate and automatic reference metadata generation provides scalability, interoperability and usability for digital libraries of both public and private institution and their collections. Accurate reference metadata extraction becomes an intriguing task to researchers who want to collect data of scientific publications; therefore, this research work proposes a metadata extraction from references of different styles with the use of regular expression. This work accurately extract metadata such as author, title of article, volume, year of publication and institution from references of different styles limiting it to six referencing style.
. B.A. Ojokoh, “Rule-based metadata extraction for heterogeneous references”, Oriental Journal of Computer Science and Technology 2 (2009).
. Houssam Nassif, Ryan Woods, “Information Extraction for Clinical Data Mining: A Mammography Case Study”, in 2009 IEEE International Conference on Data Mining Workshops (ICDMW). FL, USA, pp.37-42, December 2009.
. Bin Zhou, Yan Jia, “A Distributed Text Mining System for Online Web Textual Data Analysis”, in Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC). Huangshan, China, pp.1-4, October 2010.
. Sushain Pandit, Ontology-guided extraction of structured information from unstructured text: Identifying and capturing complex relationships, Ames, Iowa: Iowa State University, 2010.
. D. Carrell, D. Miglioretti, “Coding free text radiology reports using the cancer text information extraction system (caTIES)”, In American Medical Informatics Association Annual Symposium Proceedings (AMIA). Rochester, USA, pp.889-893, September 2007.
. L. Rokach, O. Maimon, “Information retrieval system for medical narrative reports”, In Proc. of the 6th International Conference on Flexible Query Answering Systems (FQAS). Lyon, France, pp.217–228, June 2004.
. D. Gupta, B. Morris, T. Catapano, G. Sautter, A new approach towards bibliographic reference identification, parsing and inline citation matching, in: Proceedings of the International Conference of Contemporary Computing, India, 2009, pp. 93–102.
. Bolanle Ojokoh, Ming Zhang, Jian Tang, A trigram hidden Markov model for metadata extraction from heterogeneous references, Information Sciences 181 (2011) 1538–1551.
. S.H. Papavlasopoulos, M.S. Poulos, N.T. Korfiatis, G.D. Bokos, A non linear index to evaluate a journal’s scientific impact, Information Sciences 180 (2010) 2156–2175
Copyright (c) 2021 International Journal of Computer (IJC)
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who submit papers with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
- By submitting the processing fee, it is understood that the author has agreed to our terms and conditions which may change from time to time without any notice.
- It should be clear for authors that the Editor In Chief is responsible for the final decision about the submitted papers; have the right to accept\reject any paper. The Editor In Chief will choose any option from the following to review the submitted papers:A. send the paper to two reviewers, if the results were negative by one reviewer and positive by the other one; then the editor may send the paper for third reviewer or he take immediately the final decision by accepting\rejecting the paper. The Editor In Chief will ask the selected reviewers to present the results within 7 working days, if they were unable to complete the review within the agreed period then the editor have the right to resend the papers for new reviewers using the same procedure. If the Editor In Chief was not able to find suitable reviewers for certain papers then he have the right to reject the paper.
- Author will take the responsibility what so ever if any copyright infringement or any other violation of any law is done by publishing the research work by the author
- Before publishing, author must check whether this journal is accepted by his employer, or any authority he intends to submit his research work. we will not be responsible in this matter.
- If at any time, due to any legal reason, if the journal stops accepting manuscripts or could not publish already accepted manuscripts, we will have the right to cancel all or any one of the manuscripts without any compensation or returning back any kind of processing cost.
- The cost covered in the publication fees is only for online publication of a single manuscript.