One of the data analysis methods developed by the team combines string alignment and Natural Language Processing. This method helps archivists predict whether a group of records is organized by similar names, by date, by geographical location, in sequential order, or by a combination of any of those categories. Another analysis method computes paragraph-to-paragraph similarities to automatically discover “stories” from large collections of email messages. These stories may then become the points of entry to large collections that cannot be explored manually.En dat levert dan dus plaatjes op als hierboven en hieronder, waarin het digitaal archief van de National Park Service is weergegeven.
Using visualization and data analysis methods, archivists can apply the appropriate filters to allow them to “see” the collection in a number of ways. The patterns that emerge let the archivist make decisions about the collection as a whole.
[This] image shows the types of files in a collection of NPS Web pages. Purple represents image files; green represents Web files; red represents PDFs; and black represents unknown file formats. |
Gerelateerd
Infopocalypse, kent u dat?
Wat is de relatie tussen een hamer en een RMA?
lees ook de interessante blog van Gerhard Jan Nauta (DEN) http://www.digitaalallemaal.nl/?p=3286
BeantwoordenVerwijderenDank je Bernadine, dat is inderdaad een interessante aanvulling die ik gemist had.
BeantwoordenVerwijderen