<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Reprexbase | Daniel Antal</title><link>https://danielantal.eu/tag/reprexbase/</link><atom:link href="https://danielantal.eu/tag/reprexbase/index.xml" rel="self" type="application/rss+xml"/><description>Reprexbase</description><generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><lastBuildDate>Thu, 19 Jun 2025 18:45:00 +0200</lastBuildDate><image><url>https://danielantal.eu/media/icon_hub9491570ac57158c0eeecc95c95b13e5_20247_512x512_fill_lanczos_center_3.png</url><title>Reprexbase</title><link>https://danielantal.eu/tag/reprexbase/</link></image><item><title>Help Us Build a Truly Inclusive European Music Observatory</title><link>https://danielantal.eu/post/2025-07-05-iaml-2025/</link><pubDate>Thu, 19 Jun 2025 18:45:00 +0200</pubDate><guid>https://danielantal.eu/post/2025-07-05-iaml-2025/</guid><description>&lt;p>Across Europe, music libraries are under pressure: greater expectations for
digital services, growing metadata burdens, and increasingly fragmented infrastructure.
At the same time, vital parts of our musical heritage—especially regional or
minority repertoires—remain hidden from search, discovery, and policy.&lt;/p>
&lt;div class="alert alert-note">
&lt;div>
Please meet us at 👉 &lt;a href="https://danielantal.eu/event/2025-07-07-iaml2025/">IAML2025&lt;/a> in Salzburg on 7 or 8th July. Our presentation takes place in the session of
&lt;strong>Music Libraries of Tomorrow: Reaching out to Wider Audiences&lt;/strong> at the
Mozarteum University E.001 HS Thomas Bernhard room on 7 July 2025, 16:00–17:30.
The day after you can meet us in the Gallery for the poster session.
&lt;/div>
&lt;/div>
&lt;p>We initiated the Open Music Europe project, because we believe that in the music
ecosystem, data centralisation always fails, and a new kind of cooperation
is needed—one that respects local control while enabling international reuse.&lt;/p>
&lt;p>Our Slovak pilot, the &lt;a href="https://reprex.nl/project/skcmdb/" target="_blank" rel="noopener">SKCMDb&lt;/a>, connects libraries, music centres, rights organisations,
and platforms through a shared metadata backbone based on open ontologies.
Built as a national data sharing space, it enables coordinated cataloguing and
discovery across public and private systems—from streaming services and
printed scores to CD loans and digital archives.&lt;/p>
&lt;figure id="figure-please-visit-our-poster-and-talk-with-our-team-members-daniel-antal-anna-márta-mester-librarian-data-steward-and-anna-zilkova-chairperson-of-iaml-slovakia-on-8-july-2025-10301100-in-the-gallery-you-can-download-our-poster-in-pdf-herehttpszenodoorgrecords15814286">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Please visit our poster and talk with our team members, Daniel Antal, Anna Márta Mester (librarian-data steward) and Anna Zilkova (chairperson of IAML Slovakia) on 8 July 2025 10:30–11:00 in the Gallery. You can download our poster in PDF [here](https://zenodo.org/records/15814286)." srcset="
/media/posters/IAML-reprex-poster-2025_hu29650834c1f466e3d24bbd103225d8fb_3179691_01169f53faf069cb2891f0b0e3cd648a.webp 400w,
/media/posters/IAML-reprex-poster-2025_hu29650834c1f466e3d24bbd103225d8fb_3179691_1e6b7c9ba9c6aa5d98bc5e96b49da985.webp 760w,
/media/posters/IAML-reprex-poster-2025_hu29650834c1f466e3d24bbd103225d8fb_3179691_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/posters/IAML-reprex-poster-2025_hu29650834c1f466e3d24bbd103225d8fb_3179691_01169f53faf069cb2891f0b0e3cd648a.webp"
width="538"
height="760"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
Please visit our poster and talk with our team members, Daniel Antal, Anna Márta Mester (librarian-data steward) and Anna Zilkova (chairperson of IAML Slovakia) on 8 July 2025 10:30–11:00 in the Gallery. You can download our poster in PDF &lt;a href="https://zenodo.org/records/15814286" target="_blank" rel="noopener">here&lt;/a>.
&lt;/figcaption>&lt;/figure>
&lt;p>But we also know that cultural and music policy is not only national.
It is often regional, local, or community-based. That’s why we follow the principle
of subsidiarity: letting decisions and innovation happen at the lowest competent
level, close to the collections and communities themselves.&lt;/p>
&lt;p>Our &lt;a href="https://reprex.nl/project/finnougricdataspace/" target="_blank" rel="noopener">Finno-Ugric Data Sharing Space&lt;/a>, including the LīvMDb (Livonian Music Database),
shows how even the smallest communities—without formal cultural
infrastructure—can take part in high-quality metadata production and digital
discovery. We provide the tools and models to empower local custodians,
in their language, on their terms, and without the need for large institutional support.&lt;/p>
&lt;td style="text-align: center;">
&lt;figure id="figure-please-check-out-the-demo-version-of-the-finno-ugric-dataspacehttpsreprexbaseeufuindexphptitlemain_page-or-read-the-long-form-project-descriptionhttpsreprexnldocumentsfufu">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Please check out the demo version of the [Finno-Ugric Dataspace](https://reprexbase.eu/fu/index.php?title=Main_Page) or read the long-form [project description](https://reprex.nl/documents/fu/fu)." srcset="
/media/png/dataspace/finnougric/Finno-Ugric-Sampo-20250705_16x9_hudbcd3518b03f17b8a68d3531b004eb1f_965309_de77c0577d5b27f353d35e18a5ecd92d.webp 400w,
/media/png/dataspace/finnougric/Finno-Ugric-Sampo-20250705_16x9_hudbcd3518b03f17b8a68d3531b004eb1f_965309_137c852e1e9d796114cc57a570eda16f.webp 760w,
/media/png/dataspace/finnougric/Finno-Ugric-Sampo-20250705_16x9_hudbcd3518b03f17b8a68d3531b004eb1f_965309_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/png/dataspace/finnougric/Finno-Ugric-Sampo-20250705_16x9_hudbcd3518b03f17b8a68d3531b004eb1f_965309_de77c0577d5b27f353d35e18a5ecd92d.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
Please check out the demo version of the &lt;a href="https://reprexbase.eu/fu/index.php?title=Main_Page" target="_blank" rel="noopener">Finno-Ugric Dataspace&lt;/a> or read the long-form &lt;a href="https://reprex.nl/documents/fu/fu" target="_blank" rel="noopener">project description&lt;/a>.
&lt;/figcaption>&lt;/figure>&lt;/td>
&lt;p>Now we invite IAML members—national libraries, regional centres, municipal collections,
and independent music librarians—to join us in building a federated,
decentralised European Music Observatory. One that reflects Europe’s diversity.
One that reduces data curation costs and improves visibility.
One that connects music libraries with the open data and open science infrastructures
already transforming other sectors.&lt;/p>
&lt;p>Our platform is open-source, built on FAIR principles and the
European Interoperability Framework. We use tools like Wikibase, Blazegraph,
Sampo-UI, and R—packaged to work for libraries with limited technical capacity.&lt;/p>
&lt;td style="text-align: center;">
&lt;figure id="figure-sneak-peak-http13518191513007enhttp13518191513007en">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Sneak peak: [http://135.181.91.51:3007/en/](http://135.181.91.51:3007/en/)" srcset="
/media/png/skcmdb/skcmdb-library-access_huf3155c55aaf98b7cdd63ae10eaa747e0_109899_44f93947517fcf22ddbe6ef033eed7cf.webp 400w,
/media/png/skcmdb/skcmdb-library-access_huf3155c55aaf98b7cdd63ae10eaa747e0_109899_2cb506361c74ce14745f98b518ae5453.webp 760w,
/media/png/skcmdb/skcmdb-library-access_huf3155c55aaf98b7cdd63ae10eaa747e0_109899_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/png/skcmdb/skcmdb-library-access_huf3155c55aaf98b7cdd63ae10eaa747e0_109899_44f93947517fcf22ddbe6ef033eed7cf.webp"
width="760"
height="723"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
Sneak peak: &lt;a href="http://135.181.91.51:3007/en/" target="_blank" rel="noopener">http://135.181.91.51:3007/en/&lt;/a>
&lt;/figcaption>&lt;/figure>
&lt;/td>
&lt;p>If you care about interoperability, cultural equity, and the future of
library relevance in the streaming era—this is your moment to get involved.&lt;/p>
&lt;p>Not present at &lt;code>IAML2025&lt;/code>?&lt;/br>
👉 &lt;a href="https://danielantal.eu/slides/20250707-reprex-iaml2025/">Presentation&lt;/a>&lt;/br>
👉 &lt;a href="https://zenodo.org/records/15814286" target="_blank" rel="noopener">Poster&lt;/a>&lt;/br>
👉 Please &lt;a href="https://reprex.nl/contact/" target="_blank" rel="noopener">contact us&lt;/a> directly.&lt;/p>
&lt;p>Let’s ensure music libraries remain vital entry points to Europe’s rich and evolving cultural soundscape.&lt;/p></description></item><item><title>Metadata Groundhog Day: What a Moribound Language Can Teach Spotify and Shopify</title><link>https://danielantal.eu/post/2025-06-19-gazetteer/</link><pubDate>Thu, 19 Jun 2025 18:45:00 +0200</pubDate><guid>https://danielantal.eu/post/2025-06-19-gazetteer/</guid><description>&lt;p>And if you want to fix these errors, you may find that you are back to the &lt;strong>Data Sisyphus&lt;/strong>.&lt;/p>
&lt;p>When you build systems in the cloud, or in your local architecture, at one point you will realise that naming things — places, people, products — or updating their whereabouts is probably the most time-consuming, most expensive, and most error-prone workflow.&lt;/p>
&lt;p>In this blogpost, we want to talk about what seems like the easiest part of a location: the name of the city, town, or village.&lt;/p>
&lt;h2 id="mazirbe-is-missing-again">Mazirbe Is Missing Again&lt;/h2>
&lt;p>We recently built a multilingual gazetteer — essentially a reconciled database of place names — for a tiny stretch of the Livonian coast in Latvia. At first glance, this might seem like a project rooted deeply in the digital humanities.&lt;/p>
&lt;p>But here’s the twist: the very same problems we tackled here are the ones plaguing the music industry, global e-commerce platforms, and enterprise software stacks.&lt;/p>
&lt;td style="text-align: center;">
&lt;figure id="figure-this-is-gross-irben--lielirbe--īra--irben--suur-irbenhttpsreprexbaseeufuitemq4429--familiar-with-rdf-see-in-ttlhttpsreprexbaseeufuspecialentitydataq4429ttl-klein-irben-will-be-near-gross-irben-and-irē-is-almost-īra">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="This is Gross-Irben 👉 [Lielirbe / Īra / Irben / Suur-Irben](https://reprexbase.eu/fu/Item:Q4429) 👉 [Familiar with RDF: see in TTL](https://reprexbase.eu/fu/Special:EntityData/Q4429.ttl); Klein-Irben will be near Gross-Irben, and Irē is almost Īra!" srcset="
/media/png/identifiers/geonames_lielirbe_2x1_hu2f3d53746350179bbf98c0f697c64400_111140_32eeb639bc704a3ad38976cf6b4a1e06.webp 400w,
/media/png/identifiers/geonames_lielirbe_2x1_hu2f3d53746350179bbf98c0f697c64400_111140_df330965d5c718527e1c1756ff9f6b0c.webp 760w,
/media/png/identifiers/geonames_lielirbe_2x1_hu2f3d53746350179bbf98c0f697c64400_111140_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/png/identifiers/geonames_lielirbe_2x1_hu2f3d53746350179bbf98c0f697c64400_111140_32eeb639bc704a3ad38976cf6b4a1e06.webp"
width="760"
height="380"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
This is Gross-Irben 👉 &lt;a href="https://reprexbase.eu/fu/Item:Q4429" target="_blank" rel="noopener">Lielirbe / Īra / Irben / Suur-Irben&lt;/a> 👉 &lt;a href="https://reprexbase.eu/fu/Special:EntityData/Q4429.ttl" target="_blank" rel="noopener">Familiar with RDF: see in TTL&lt;/a>; Klein-Irben will be near Gross-Irben, and Irē is almost Īra!
&lt;/figcaption>&lt;/figure>
&lt;/td>
&lt;p>Mazirbe is a small, big place. It definitely exists, and it is the cultural center of a small nation: the Livonians. Yet, when you are looking for clothing, music, or photographs that should come from Mazirbe in a relevant database, you often find nothing. Not even the place.&lt;/p>
&lt;div class="alert alert-note">
&lt;div>
&lt;h4 id="but-mazirbe-exists">But Mazirbe exists!&lt;/h4>
&lt;p>Depending on the record, it might appear as:&lt;/p>
&lt;ul>
&lt;li>Mazirbe (Latvian) • Irē (Livonian) • Мазирбе (Russian) • Klein-Irben (German) • Suur-Irben (Finnish-German hybrid) •
Мазирбе (Russian) • Mazirbė (Lithuanian)&lt;/li>
&lt;/ul>
&lt;td style="text-align: center;">
&lt;figure id="figure-meyers-zeitungsatlas-050--russland--gouvernement-sankt-petersburg-esthland-liefland-kurlandhttpsuploadwikimediaorgwikipediacommons004meyere28098s_zeitungsatlas_050_e28093_russland-_gouvernement_sankt_petersburg2c_esthland2c_liefland2c_kurlandjpg">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="[Meyer‘s Zeitungsatlas 050 – Russland- Gouvernement Sankt Petersburg, Esthland, Liefland, Kurland](https://upload.wikimedia.org/wikipedia/commons/0/04/Meyer%E2%80%98s_Zeitungsatlas_050_%E2%80%93_Russland-_Gouvernement_Sankt_Petersburg%2C_Esthland%2C_Liefland%2C_Kurland.jpg)" srcset="
/media/webp/identifiers/old_map_of_courland_hue01bb2cd2f71c02c57a3f3e8212ca966_977008_0c287770dd041e1a97efbed60791a980.webp 400w,
/media/webp/identifiers/old_map_of_courland_hue01bb2cd2f71c02c57a3f3e8212ca966_977008_9d4478bc71dd6069b90b91b75e983c5e.webp 760w,
/media/webp/identifiers/old_map_of_courland_hue01bb2cd2f71c02c57a3f3e8212ca966_977008_1200x1200_fit_q75_h2_lanczos_2.webp 1200w"
src="https://danielantal.eu/media/webp/identifiers/old_map_of_courland_hue01bb2cd2f71c02c57a3f3e8212ca966_977008_0c287770dd041e1a97efbed60791a980.webp"
width="760"
height="522"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
&lt;a href="https://upload.wikimedia.org/wikipedia/commons/0/04/Meyer%E2%80%98s_Zeitungsatlas_050_%E2%80%93_Russland-_Gouvernement_Sankt_Petersburg%2C_Esthland%2C_Liefland%2C_Kurland.jpg" target="_blank" rel="noopener">Meyer‘s Zeitungsatlas 050 – Russland- Gouvernement Sankt Petersburg, Esthland, Liefland, Kurland&lt;/a>
&lt;/figcaption>&lt;/figure>
&lt;/td>
&lt;/div>
&lt;/div>
&lt;p>This kind of variation isn’t just a cultural footnote — it breaks databases, mismatches search results, and silently corrupts analytics.&lt;/p>
&lt;p>If you&amp;rsquo;re in music metadata, this is your &lt;strong>&amp;ldquo;JAY Z&amp;rdquo; vs. &amp;ldquo;Jay-Z&amp;rdquo; vs. &amp;ldquo;Shawn Carter&amp;rdquo;&lt;/strong> problem.&lt;/p>
&lt;p>If you&amp;rsquo;re in e-commerce, it’s &lt;strong>“Red Crewneck XXL” vs. “Crewneck, crimson, 2XL”&lt;/strong>.&lt;/p>
&lt;p>Same data structure. Same unresolved chaos.&lt;/p>
&lt;h2 id="a-gazetteer-that-works-like-real-life">A Gazetteer That Works Like Real Life&lt;/h2>
&lt;p>We created a semantic, multilingual, multiscript gazetteer for the Livonian coast. Each place entry includes:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>All known name variants across time, languages, and scripts&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Structured links to global authority services (Wikidata, VIAF, GeoNames)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Canonical IDs, multilingual labels, and machine-readable formats (RDF, TTL, etc.)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Context about administrative boundaries, historical changes, and source provenance&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Try us:&lt;/p>
&lt;td style="text-align: center;">
&lt;figure id="figure--mazirbe--irē--klein-irben--мазирбе--mazirbėhttpsreprexbaseeufuitemq4202--familiar-with-rdf-see-in-ttlhttpsreprexbaseeufuspecialentitydataq4202ttl">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="👉 [Mazirbe / Irē / Klein-Irben / Мазирбе / Mazirbė](https://reprexbase.eu/fu/Item:Q4202) 👉 [Familiar with RDF: see in TTL](https://reprexbase.eu/fu/Special:EntityData/Q4202.ttl)" srcset="
/media/png/identifiers/fuds_mazirbe_2x1_hue147fb84f0dcec7ccc29241d53d4804a_93374_a8f87b64beac38f2fa0357c52941c885.webp 400w,
/media/png/identifiers/fuds_mazirbe_2x1_hue147fb84f0dcec7ccc29241d53d4804a_93374_806e2facbbacdd3d0e7b238e4f7b504b.webp 760w,
/media/png/identifiers/fuds_mazirbe_2x1_hue147fb84f0dcec7ccc29241d53d4804a_93374_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/png/identifiers/fuds_mazirbe_2x1_hue147fb84f0dcec7ccc29241d53d4804a_93374_a8f87b64beac38f2fa0357c52941c885.webp"
width="760"
height="380"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
👉 &lt;a href="https://reprexbase.eu/fu/Item:Q4202" target="_blank" rel="noopener">Mazirbe / Irē / Klein-Irben / Мазирбе / Mazirbė&lt;/a> 👉 &lt;a href="https://reprexbase.eu/fu/Special:EntityData/Q4202.ttl" target="_blank" rel="noopener">Familiar with RDF: see in TTL&lt;/a>
&lt;/figcaption>&lt;/figure>
&lt;/td>
&lt;p>We published it using &lt;strong>Wikibase&lt;/strong> — the same technology that powers Wikidata. It&amp;rsquo;s not just a spreadsheet; it&amp;rsquo;s a small, dynamic knowledge graph.&lt;/p>
&lt;p>And we also put it into &lt;strong>BlazeGraph&lt;/strong>, so you can find all these villages — and also the music, the clothing, or photographs that come from them.&lt;/p>
&lt;h2 id="so-what">So What?&lt;/h2>
&lt;p>Here’s why this matters outside the northern shores of Kurzeme, or beyond the borders of Latvia:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>In global &lt;strong>supply chains&lt;/strong>, location names and vendor names drift constantly. While country boundaries are relatively stable, subnational boundary changes — counties, parishes, provinces, municipal borders — happen &lt;strong>thousands of times per year&lt;/strong>, even within Europe.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>In &lt;strong>streaming metadata&lt;/strong>, artists get duplicated, misspelled, or transliterated inconsistently. It’s not unusual to find &lt;strong>dozens of same-named artists&lt;/strong> in a distributor’s or rights manager’s roster.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>In &lt;strong>CRM systems&lt;/strong>, customers have multiple entries because of one diacritic. &lt;em>Irē&lt;/em> becomes &lt;em>Ire&lt;/em> if the user didn’t have &lt;code>ē&lt;/code> installed.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>In &lt;strong>museum heritage databases&lt;/strong> and &lt;strong>webshops&lt;/strong>, items disappear because their place of origin changed names three times since the accession record was created.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Our little example was created to accompany a digital humanities publication, but it&amp;rsquo;s &lt;strong>not just a “humanities” problem&lt;/strong>. It’s a &lt;strong>cross-sector, multilingual, historical, bureaucratic, data problem&lt;/strong>.&lt;/p>
&lt;p>And we’re all living in it.&lt;/p>
&lt;h2 id="lessons-we-took-away">Lessons We Took Away&lt;/h2>
&lt;ul>
&lt;li>
&lt;p>Don’t fight ambiguity. &lt;strong>Model it.&lt;/strong>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Linked data models (RDF, Wikibase) handle aliases and variants with elegance.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Small, local, curated vocabularies can scale conceptually to global systems.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Top-down standardization fails in diverse data ecosystems — &lt;strong>context wins&lt;/strong>.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="see-it--fork-it--repurpose-it">See It / Fork It / Repurpose It&lt;/h2>
&lt;p>You can explore the full Livonian Gazetteer here:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Web UI: &lt;a href="https://reprexbase.eu/fu/Main_Page" target="_blank" rel="noopener">https://reprexbase.eu/fu/Main_Page&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>RDF example: &lt;a href="https://reprexbase.eu/fu/Special:EntityData/Q4429.ttl" target="_blank" rel="noopener">https://reprexbase.eu/fu/Special:EntityData/Q4429.ttl&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Also check out our &lt;a href="https://reprexbase.eu/textilebase/" target="_blank" rel="noopener">TextileBase&lt;/a> project — same model, but for 19th-century Latvian shirts and skirts&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>If your stack includes &lt;strong>messy location names, user-generated labels, non-English content, or legacy records&lt;/strong> — maybe this can help.&lt;/p>
&lt;p>And if you feel like you’ve seen this movie before… you have.&lt;/p>
&lt;p>It’s &lt;strong>Data Sisyphus&lt;/strong> all over again.&lt;/p>
&lt;p>👉 &lt;a href="https://reprex.nl/post/2021-07-08-data-sisyphus/" target="_blank" rel="noopener">https://reprex.nl/post/2021-07-08-data-sisyphus/&lt;/a>&lt;/p></description></item><item><title>Linked Open Datasets on Garments from the Latgale Region</title><link>https://danielantal.eu/post/2025-04-07_latgalean_dataset/</link><pubDate>Mon, 07 Apr 2025 17:00:00 +0100</pubDate><guid>https://danielantal.eu/post/2025-04-07_latgalean_dataset/</guid><description>&lt;td style="text-align: center;">
&lt;figure id="figure-entry-examples-from-linked-open-datasets-on-garments-from-the-latgale-region-from-right-to-left-q142httpsreprexbaseeutextilebaseindexphptitleitemq142-q180httpsreprexbaseeutextilebaseindexphptitleitemq180-q179httpsreprexbaseeutextilebaseindexphptitleitemq179-q181httpsreprexbaseeutextilebaseindexphptitleitemq181">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Entry examples from `Linked Open Datasets on Garments from the Latgale Region`, from right to left: [Q142](https://reprexbase.eu/textilebase/index.php?title=Item:Q142), [Q180](https://reprexbase.eu/textilebase/index.php?title=Item:Q180), [Q179](https://reprexbase.eu/textilebase/index.php?title=Item:Q179), [Q181](https://reprexbase.eu/textilebase/index.php?title=Item:Q181)." srcset="
/media/png/dataspace/textilebase/Textilebase_four_images_hud9c1ab3a1a107b842d90f84f6576b635_244654_c7d8e4451d5ce9ff2a6b3c6620e7010a.webp 400w,
/media/png/dataspace/textilebase/Textilebase_four_images_hud9c1ab3a1a107b842d90f84f6576b635_244654_e05a6496c265420cb806ca4974eaa553.webp 760w,
/media/png/dataspace/textilebase/Textilebase_four_images_hud9c1ab3a1a107b842d90f84f6576b635_244654_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/png/dataspace/textilebase/Textilebase_four_images_hud9c1ab3a1a107b842d90f84f6576b635_244654_c7d8e4451d5ce9ff2a6b3c6620e7010a.webp"
width="760"
height="380"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
Entry examples from &lt;code>Linked Open Datasets on Garments from the Latgale Region&lt;/code>, from right to left: &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Item:Q142" target="_blank" rel="noopener">Q142&lt;/a>, &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Item:Q180" target="_blank" rel="noopener">Q180&lt;/a>, &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Item:Q179" target="_blank" rel="noopener">Q179&lt;/a>, &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Item:Q181" target="_blank" rel="noopener">Q181&lt;/a>.
&lt;/figcaption>&lt;/figure>
&lt;/td>
&lt;p>The first published dataset, &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Linked_Open_Datasets_on_Garments_from_the_Latgale_Region" title="Linked Open Datasets on Garments from the Latgale Region" target="_blank" rel="noopener">Linked Open Datasets on Garments from the Latgale Region&lt;/a> contains data on Latvian traditional shirts and skirts from the Latgale region in Eastern Latvia. The The &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Item:Q232" title="Item:Q232" target="_blank" rel="noopener">female&lt;/a> and &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Item:Q233" title="Item:Q233" target="_blank" rel="noopener">male shirts&lt;/a>, and the &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Item:Q234" title="Item:Q234" target="_blank" rel="noopener">skirts&lt;/a> in the dataset are handmade and were worn in the 19th century. They represent both festive and daily wear of the local female and male peasants. The shirts are stored at the &lt;a href="https://reprexbase.eu/textilebase/index.php?title=National_History_Museum_of_Latvia" title="National History Museum of Latvia" target="_blank" rel="noopener">National History Museum of Latvia&lt;/a> and the &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Ethnographic_Open-Air_Museum_of_Latvia" title="Ethnographic Open-Air Museum of Latvia" target="_blank" rel="noopener">Ethnographic Open-Air Museum of Latvia&lt;/a>. The data contain information on the locality of their origin, their approximate date of creation with various precisions, the materials they are made of, and the way of their fabrication, as well as their purpose of wearing (festive or daily wear) and wearer’s ethnicity and gender. They also include the name of the museum each shirt is stored at, supplemented with its unique inventory number. Data on some sample shirts also include a photo of the shirt.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Check out the properties (relations) in the data model: &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Special:ListProperties" title="Special:ListProperties" target="_blank" rel="noopener">ListProperties&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>See every entry in the database: &lt;a href="https://reprexbase.eu/textilebase/index.php?title=Special:AllPages&amp;amp;from=&amp;amp;to=&amp;amp;namespace=120" target="_blank" rel="noopener">All items&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul></description></item><item><title>Dataweek²⁴: Data-driven Compliance with the Corporate Social Responsibility Directive</title><link>https://danielantal.eu/post/2024-06-06_dataweek_csrd_compliance/</link><pubDate>Thu, 06 Jun 2024 16:22:00 +0100</pubDate><guid>https://danielantal.eu/post/2024-06-06_dataweek_csrd_compliance/</guid><description>&lt;td style="text-align: center;">
&lt;figure id="figure-all-slidesslides20240605_d_antal_csrd_automated_compliance">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="[All slides](/slides/20240605_d_antal_csrd_automated_compliance/)" srcset="
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240606_D_Antal_Dataweek_CSRD_Compliance_2_huff2b9e2a5100d3b1a1ff280d3fcfd4e1_440501_3bd0c6cf7e1fbc7522302ca816b1aed3.webp 400w,
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240606_D_Antal_Dataweek_CSRD_Compliance_2_huff2b9e2a5100d3b1a1ff280d3fcfd4e1_440501_1d681a0f5aa131275b21bbd50ffeba21.webp 760w,
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240606_D_Antal_Dataweek_CSRD_Compliance_2_huff2b9e2a5100d3b1a1ff280d3fcfd4e1_440501_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240606_D_Antal_Dataweek_CSRD_Compliance_2_huff2b9e2a5100d3b1a1ff280d3fcfd4e1_440501_3bd0c6cf7e1fbc7522302ca816b1aed3.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
&lt;a href="https://danielantal.eu/slides/20240605_d_antal_csrd_automated_compliance/">All slides&lt;/a>
&lt;/figcaption>&lt;/figure>&lt;/td>
&lt;p>&lt;em>This is a lightly edited version of the presentation at the Data-driven and Automated Compliance section of Dataweek²⁴&lt;/em> &lt;a href="https://danielantal.eu/event/2024-06-05_dataweek_leuven/">Event page&lt;/a>.&lt;/p>
&lt;p>It has often been said that ESG reporting is a data problem. If you want to fulfil the new requirements set by the Corporate Social Responsibility Directive, which is a legal act that changes European laws on financial accounting and its audit or assurance, you will encounter a very serious data linking and integration problem. If you have such sustainability bookkeeping, you must be able to factually link your financial accounts to your environmental and social accounts. In simple terms, if you expense the cost of 1 MW of electricity, then you cannot calculate the footprint of 0.98 MW in your sustainability report.&lt;/p>
&lt;td style="text-align: center;">
&lt;figure id="figure-all-slidesslides20240605_d_antal_csrd_automated_compliance">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="[All slides](/slides/20240605_d_antal_csrd_automated_compliance/)" srcset="
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_3_hu35497785547ade41c544da690fc87aea_70818_93dc9414a0f2bc07ef71feb56b723839.webp 400w,
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_3_hu35497785547ade41c544da690fc87aea_70818_11005af9d4188a15f0edb76204fc2185.webp 760w,
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_3_hu35497785547ade41c544da690fc87aea_70818_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_3_hu35497785547ade41c544da690fc87aea_70818_93dc9414a0f2bc07ef71feb56b723839.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
&lt;a href="https://danielantal.eu/slides/20240605_d_antal_csrd_automated_compliance/">All slides&lt;/a>
&lt;/figcaption>&lt;/figure>&lt;/td>
&lt;p>ESG introduces two related challenges, which are best addressed by explicit knowledge bases, enterprise graphs connected to open knowledge graphs, and data sharing spaces.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;input checked="" disabled="" type="checkbox"> The company must be able to collect data and report on other companies and things that happen outside of the boundaries of the company or company group. Such a practice had been in place in some heavily regulated industries; for example, the food industry had to bear a qualified responsibility over the entire food supply chain or the nuclear industry for the lifecycle of the fossile fuels.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;input checked="" disabled="" type="checkbox"> The company must also be able to curate trustworthy data about entirely new domains: data about the natural or ecological environment, such as data on biodiversity, recycling, water; and data about the social environment, for example, indicators about affected communities, workers in the supply chain or end users.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Joining a data sharing space is a good solution because the new data requirements are not one-time data upgrades but require a permanent data connection with the ecological and social environment. A company and its ERP system or its key performance management users cannot import new data, such as metadata of the EU Taxonomy regulation, and call it a day. Data curation must be an ongoing activity.&lt;/p>
&lt;td style="text-align: center;">
&lt;figure id="figure-all-slidesslides20240605_d_antal_csrd_automated_compliance">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="[All slides](/slides/20240605_d_antal_csrd_automated_compliance/)" srcset="
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_7_hu4629b999ec15561e6f79fc1523681153_124114_c5d150ce27e861f751950e3e18964193.webp 400w,
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_7_hu4629b999ec15561e6f79fc1523681153_124114_fccbf7977b071b2ba00e7333dbc4870a.webp 760w,
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_7_hu4629b999ec15561e6f79fc1523681153_124114_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_7_hu4629b999ec15561e6f79fc1523681153_124114_c5d150ce27e861f751950e3e18964193.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
&lt;a href="https://danielantal.eu/slides/20240605_d_antal_csrd_automated_compliance/">All slides&lt;/a>
&lt;/figcaption>&lt;/figure>&lt;/td>
&lt;p>Most companies import relatively little data regularly, and therefore, they have little experience in data curation, i.e., the art of organising, annotating, and integrating data collected from various sources in a way that is presentable in the form of indicators or can be reused later. Perhaps their bookkeeping needs to import foreign exchange rates regularly if they export or import in their activities. Data curation will become an ongoing activity if they start to monitor and measure the environmental and social environment impacts of their actions.&lt;/p>
&lt;p>So, if we agree that joining a data sharing space, i.e., an organisation that has pre-agreed terms and conditions on sharing and exchanging data, with pre-agreed terminology or vocabulary (&amp;ldquo;semantics&amp;rdquo;) and technology, the question is, what kind of data sharing organisation is the most appropriate?&lt;/p>
&lt;p>The CSRD Directive has industry-agnostic elements, which must be fulfilled in every company, and industry-specific elements, which are being developed as we speak. For example, we work mainly with music and film production, both of which belong to the Media and Entertainment Group, which must apply the same industry-specific variations of the European Sustainability Development Standards. We think that the best is to create industry-specific data sharing spaces, such as the famous Data 4.0 for manufacturing, but allow them to be federated and to exploit further synergies: there are plenty of financial, economic, social or environmental data that are used by other existing data sharing spaces and it is not necessary to curate and produce them in, for example, a music or film-production oriented data sharing space.&lt;/p>
&lt;p>Are for-profit and social enterprises technically ready to join data-sharing spaces? Not without help. While some large corporations have explicit knowledge bases and enterprise graphs, most smaller European enterprises do not necessarily have a distinct IT function. It is a simple relational database with a fixed schema if they manage databases. Bringing them on board requires simple systems and assistance.&lt;/p>
&lt;td style="text-align: center;">
&lt;figure id="figure-all-slidesslides20240605_d_antal_csrd_automated_compliance">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="[All slides](/slides/20240605_d_antal_csrd_automated_compliance/)" srcset="
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_6_hu6b783c6b8e4188d05bc95a3713e01010_165927_1d1b9767c6a5bd995bc787e3bca443c0.webp 400w,
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_6_hu6b783c6b8e4188d05bc95a3713e01010_165927_62d9a4dd919c7ad85880c8c5da5c7c62.webp 760w,
/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_6_hu6b783c6b8e4188d05bc95a3713e01010_165927_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://danielantal.eu/media/slides/20240605_D_Antal_CSRD_Automated_Compliance/20240605_D_Antal_CSRD_Automated_Compliance_6_hu6b783c6b8e4188d05bc95a3713e01010_165927_1d1b9767c6a5bd995bc787e3bca443c0.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
&lt;a href="https://danielantal.eu/slides/20240605_d_antal_csrd_automated_compliance/">All slides&lt;/a>
&lt;/figcaption>&lt;/figure>&lt;/td>
&lt;p>Reprex is building a data sharing system, Reprexbase, which is built around Wikibase as a knowledge broker system. Wikibase is the open-source software that hosts Wikidata, the largest open knowledge graph in the world. We extend it with various ETL modules and an ecosystem of peer-reviewed statistical libraries to create scientifically correct impact indicators and benchmarks. These extensions are necessary not only because most enterprises are not familiar with working with linked data or graphs but also because Companies are usually familiar with creating financial indicators (for SMEs, simple accounting indicators, larger enterprises with a controlling function, and more complex indicators) but not with the creation of non-financial statistical indicators. The CSRD directive calls for reporting more than 200 indicators over five environmental and four social matter groups, which is more than a company would have on a balanced scorecard. Reliably producing so many indicators is no small feat.&lt;/p>
&lt;p>Large and public companies are directly responsible for CSRD reporting, i.e., integrating financial, environmental and social accounting. They need to improve their bookkeeping or ERP systems because often they do not even organise into a database much of the contents of the current invoices (physical quantities, units), which would allow them to create the factual basis between energy cost, energy use and energy footprint, for example. However, they must ask their suppliers for further environmental and social data. Without readily applicable Digital Product Passports, this is a considerable cost, estimated at around 1500 euros per supplier. If you are a festival organiser or a film producer with hundreds of suppliers, this will quickly pay for a simple data sharing space; if you do it with a cluster in your industry, the costs can be significantly reduced.&lt;/p></description></item></channel></rss>