(AGENPARL) - Roma, 27 Giugno 2022(AGENPARL) – lun 27 giugno 2022 You are subscribed to Library of Congress Labs Letter from the Library of Congress.
June 2022
LC LABS LETTER
A Monthly Roundup from the Library of Congress Labs Team
Data & Libraries
This month’s issue is devoted to updates on LC Labs’ latest collaborations in this exciting space.
Providing access to collections “as data”
Specifically, the Jupyter Notebook runs the code for how to:
– Put the data into a dataframe, akin to a spreadsheet, to help make the data more comprehensible and easier to manipulate and analyze.
– Limit the scope of the data if you don’t have the computing power to do an analysis on the entire dataset (for example, looking at data for just one year or limiting it to a specific number of rows).
– In addition to these preliminary steps to prepare the data, it demonstrates potential avenues for analysis—by mimetype (media type) and by the most commonly occurring words in the text within the CDX files.
Exploring data in the Library’s collections as primary sources