<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Openpyxl on Digital Archive Systems Tech Blog</title><link>https://tech.ldas.jp/en/tags/openpyxl/</link><description>Recent content in Openpyxl on Digital Archive Systems Tech Blog</description><generator>Hugo</generator><language>en</language><lastBuildDate>Tue, 05 May 2026 08:00:00 +0900</lastBuildDate><atom:link href="https://tech.ldas.jp/en/tags/openpyxl/index.xml" rel="self" type="application/rss+xml"/><item><title>Probing the Public APIs of the Tohoku University Digital Archives — Exporting per-setSpec Excel via OAI-PMH</title><link>https://tech.ldas.jp/en/posts/tohoku-digital-archives-oai-pmh/</link><pubDate>Tue, 05 May 2026 08:00:00 +0900</pubDate><guid>https://tech.ldas.jp/en/posts/tohoku-digital-archives-oai-pmh/</guid><description>&lt;blockquote>
&lt;p>This article is co-authored with generative AI. While I have cross-checked facts against official documentation where possible, errors may remain. Please verify primary sources before making important decisions.&lt;/p>&lt;/blockquote>
&lt;p>While poking around the Derge Tibetan Tripitaka database hosted on the Tohoku University Digital Archives (&lt;a href="https://touda.tohoku.ac.jp/collection/">touda.tohoku.ac.jp/collection&lt;/a>), I wondered whether there was any path that returned JSON, and ended up checking the available public APIs one by one. In the end OAI-PMH turned out to be the workable route, so this post records the procedure for harvesting per-setSpec into Excel files. The whole approach avoids HTML scraping.&lt;/p></description></item></channel></rss>