IPTC are working an ontology known as rNews which aims to standardise (and encourage the adoption of) RDFa in news articles.
This is a very, very good idea – it should allow for better content discovery, new ways to aggregate news stories about people, places or subjects and generally allow computers to help people process some of the structured information behind a story.
rNews is still in draft. At the time of writing the published spec is at version 0.1, there are clearly ambitions to built out on this work and it will be interesting to see where it goes.
Although I’m sure much of this has been thought about before I thought I would jot down my initial thoughts on this early draft.
More URIs please
The current spec makes extensive use of xsd:string and xsd:double to assign attributes to a class. For example, the Location Class includes attributes for longitude, latitude and altitude but no URIs for places.
Using URIs to name places (and people, subjects, organisations etc.) would allow for much more interesting things to be done with the data.
It would make it easier to aggregate content from more than one news outlet and generally link things together by location, person and area of interest.
There’s obviously an issue here – there needs to be a good source of URI for places – but in reality there are lots of candidates out there from dbpedia to geonames.
Greater reuse of existing vocabularies
There are existing vocabularies that describe the some of the classes described in rNew – notably FOAF and Dublin Core.
I would prefer rNews reusing those vocabularies or at least linking (owl:sameAS) to them.
I’m not a fan of tags
I don’t really like “tagging” it lack semantics and is extremely ambiguous.
If I tag a news story am I claiming it’s primarily about that thing, features that thing, also about that thing, what? And whatever you think it means I guarantee I can find someone else who disagrees!
I would rather see more defined predicates such as primarilyAbout etc. I recognise this would add a bit of complexity but it would also increase the utility of the vocabulary.
If the intention is to aid discoverability through categorisation then use SKOS.
Explicit predicates for source materials
I think it’s really important to explicitly link to source material, especially for science and medicine (it’s why Nature News and has always done so).
A simple set of predicates for the DOI, abstract URI, scientist/researcher of the original research and/or a URI for the raw data should suffice.
Again, it would also help if there was a handy source of URIs for scientists.
Should the story be at the heart of the ontology?
I’ve always thought of news stories as metadata about real world events.
If you reframe the problem in this way then what you really want are predicates to describe the relationship of the story (article, photo, video) to the event. You also then want links between people & places and those events (which could be inferred through the various news stories).
Building the ontology this way round would allow for some very powerful analysis and discovery of stories.
Anyway – I’ll be really interested to see how the ontology develops and how widely it gets adopted.
Leave a Reply to Tom Scott Cancel reply