VNP
Define a data communication protocol about articles importation into VirtualNewspaper platform from third-party publishing systems, in order to integrate text-only versions of the same and enable speech synthesis featuers, within web and mobile application.
Virtualnewspaper is able to import articles coming from publishing systems, parsing a dedicated XML flow taken from the publishing system. This XML flow needs to conform to the guidelines described in this technical reference sheet.
XML contents and PDF files, will be uploaded to a Virtualcom FTP server, following a folder hierarchy described below:
Root folder, has to be named reporting:
1) Pubblication date and title
2) A single multi-page PDF or multiple single-page PDF files, easily and uniquely identified, named as stated at point 1)
3) Root folder has to contain an XML folder, containing XML files, one for each article easily and uniquely
identified, named as stated at point 1). XML files will be created as described in paragraph 7 from this
technical reference sheet. Moreover, the XML folder will containg all the referenced images.
XML files will present a structure described below:
<article> <metadata> <article_id></article_id> <section></section> </metadata> <content> <occ></occ> <headline>article main title</headline> <subtitle></subtitle> <lead>article summary</lead> <author></author> <author_email></author_email> <body> <text>paragraph 1</text> <text>paragraph 2</text> </body> <quote></quote> <pullquote></pullquote> <nitf-table>
</nitf-table> <images> <pic filename= w= h= x= y=> <pic_caption></pic_caption> </pic> </images> <media media-type="mediaType(video|url|image)"> <media-storage>direct/url</media-storage> <media-reference>filname / url</media-reference> <media-caption>Description</media-caption> </media> <artMetadata> <artLinkedPage Number= /> </artMetadata> </content> <coords> <headCoord h="" w="" x="" y=""/> </coords> </article>
<article> <metadata> <article_id>1234</article_id> <section>News</section> </metadata> <content> <occ><![CDATA[The event]]></occ> <headline><![CDATA[Brewery seeks payout from council over link road]]></headline> <subtitle><![CDATA[A BREWERY claims Wrexham’s new £30 million industrial estate]]> <lead><![CDATA[ the pub has been left in a virtual backwater. Mr Hopkins said that ever since work started to build the link road, trade at the Gredington had been drying up to the point where it was no longer sustainable ]]></lead> <author><![CDATA[Barth McThomas]]></author> <author_email><![CDATA[barth.mcthomas@me.com]]></author_email> <body> <text><![CDATA[ Hydes, based in Manchester, is seeking compensation from Wrexham Council following the clo - sure of the Gredington Arms in Llanypwll. Brewery managing director Chris Hopkins said the Gredingtonused to be on the main route to the industrial estate, with a ready supply of customers stopping off for refreshment. But now the link route has diverted traffic away, the pub has been left in a virtual backwater. Mr Hopkins said that ever since work started to build the link road, trade at the Gredington had been drying up to the point where it was no longer sustainable . “We didn’t take the decision to close down the Gredington lightly, it has been with us for many years and doing steady business,” said Mr Hopkins. “We were concerned even before the building of the link road. But ever since construction work started on the scheme, the level of custom has been hit.]]> </text> </body> <quote><![CDATA[There are many local nooks that fishing fans may want to keep a special eye one.]]></quote> <pullquote><![CDATA[As can be seen from the table below, the shores of Oceanview again present the brightest spots for fishermen and sandcastle-builders alike.]]></pullquote> <nitf-table>
today | tide | tomorrow | next day | |
---|---|---|---|---|
beach | high | low | in | out |
Sunset | 30 | 14 | 28 | 11 |
</nitf-table> <images> <pic filename='23w1p01GreddingtonArmsINT.jpg' w=’10’ h=’15’ x=’30’ y=’40’> <pic_caption><![CDATA[The Gredington Arms in Llanypwll, Wrexham]]></pic_caption> </pic> </images> <media media-type="image"> <media-reference>filename.jpg</media-reference> <media-caption>Photo: Ira Magaziner</media-caption> </media> <media media-type="video"> <media-storage>direct</media-storage> <media-reference>filename.mp4</media-reference> <media-caption>Local video</media-caption> </media> <media media-type="video"> <media-storage>url</media-storage> <media-reference>http://www.youtube.com/x</media-reference> <media-caption>Generic video</media-caption> </media> <media media-type="url"> <media-reference>http://www.google.it</media-reference> <media-caption>Google</media-caption> </media> <artMetadata><artLinkedPage Number='27' /></artMetadata> </content> <coords> <headCoord h="15" w="98" x="0" y="0"/> </coords> </article>
The <coords> tag is not mandatory.
Coordinates are always to be expressed in percentage, referring their position (x,y) and their size (w,h), starting from the top upper left corner of the page.