EUREKA XML CONVERSION


Outsource XML Services to Eureka Science

In the current competitive corporate world, companies dealing with a humongous amount of data often require transforming their data from one format to another desired one. Data conversion plays a significant role in converting data in a different format than now. The key to the success of any organization lies in how that organization manages the presentation of information. Eureka Data Services is a pioneer in providing XML data conversion, having expertise in NISO Journal Article Tag Suite (JATS), Archiving and Interchange Tag Suite, or the NLM DTDs. We are adept at converting STM articles to NLM Journal Publishing Tag Set versions 2.3 and 3.0.


WHAT IS XML?

The Extensible Markup Language (XML) is a markup language that is used to store and transport data. It is a set of rules for generating custom markup languages. The basic function of XML is to conveniently create information formats and electronically share structured data across various information systems through the internet. Moreover, it is used for the encoding and serialization of data. XML is an open standard and is highly recommended by the W3 Consortium.


WHAT IS XML DATA CONVERSION?

XML Conversion is the practice of converting data from a specific format into an XML format through a set of rules that are defined by the XML language. XMX data is called self-describing or self-defining, which signifies that the structure of data is stored within the data which rules out the need to create a structure from scratch to store data. Conversion of data into XML format makes the data more presentable and user-friendly. The XML data conversion process transforms the data into XML format, making it highly reusable and enabling a more efficient information delivery system.


XML SERVICES OFFERED

  • HTML to XML
  • MS-Word to XML
  • PDF to XML
  • XML tagging services

NLM JOURNAL PUBLISHING TAG SET VERSION 2.3, 3.0 AND JATS V1.3

The Journal Publishing Tag Set outlines elements and attributes that define the content of the articles of journals, besides some other text-based material present in journals, such as letters, editorials, and book and product reviews. This Tag Set was transcribed to describe both the metadata for a journal article and the content of the article. However, it can also be used to describe only the article header metadata. This is a reasonably prescriptive tag set for the archives to standardize and control the content of journal material. Although designed for biomedical journals, this tag set should adequately describe STM journals and technical journals in any field. The Journal Publishing Document Type Definition (DTD) is the initial constraint language for the tag set. The DTD comprises a few Publishing-DTD-specific modules and uses (by reference) the base modules of the Journal Archiving and Interchange Suite. The components of that Suite were developed to create XML applications through which data on health-related disciplines could be shared and reused electronically. The Suite can be (and has been) used to construct many tag sets in addition to this one. Although the full Suite was developed to support electronic production, the structures sufficiently cater to print production as well. This being a Publishing Tag Set optimized for regularizing an archive, rather than a strict preservationist tag set, it is quite restricted with just a few elements and limited choices in many contexts as compared to the full Journal Archiving and Interchange Tag Set. The Archiving Tag Set offers several ways to express the same information, the aim for Publishing was to adopt any single route to tag the same content. It was not the intent to restrict the expressive power licensed by this tag set, but rather to limit the futile choices that a full interchange tag set needs to make to accommodate conversion from a wide variety of formats. The philosophy for the Archiving Tag Set was to accept varied forms of many structures unchanged. On the contrary, the philosophy for the Publishing Tag Set was to accept a wide variety of structures and to regularize those that matter to the archive. This Tag Library describes the Publishing Tag Set and the Journal Archiving and Interchange Suite by providing:


  • • An outline of the design principles for the whole modular Archiving and Interchange Suite;

  • • A description of how to read and use this Tag Library;

  • • An introduction to this Journal Publishing Tag Set;

  • • An account of how to build new DTDs from the modules in the Suite;

  • • Extensive reference sections describing the elements, attributes, and Parameter Entities defined in the tag set (This part of the Tag Library also provides individual definitions and tagged examples for most of the elements and attributes defined in the full Suite.);

  • • Hierarchical diagrams highlighting the structures in the Journal Publishing Tag Set;

  • • Tables and appendices for reference on the fine points of the Tag Set;

  • • Reading copies of the DTD modules of the Journal Publishing DTD as well as the modules that make up the full modular Suite.

  • • A full article sample, both as a PDF file showing the format and as an XML file valid to this tag set.

ARTICLE STRUCTURE IN NLM JOURNAL PUBLISHING DTD



FRONT MATTER OF AN ARTICLE



BODY OF AN ARTICLE



BACK MATTER OF AN ARTICLE



FLOATS WRAPPER STRUCTURE



SUB-ARTICLE STRUCTURE



RESPONSE STRUCTURE