Institutional Books: A 242B token dataset from Harvard Library's collections

(arxiv.org)

76 points | by strangecasts a day ago ago

27 comments