<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>open-data | Antal Dániel honlapja</title><link>https://danielantal.eu/hu/tag/open-data/</link><atom:link href="https://danielantal.eu/hu/tag/open-data/index.xml" rel="self" type="application/rss+xml"/><description>open-data</description><generator>Wowchemy (https://wowchemy.com)</generator><language>hu</language><lastBuildDate>Tue, 29 Apr 2025 00:00:00 +0000</lastBuildDate><image><url>https://danielantal.eu/media/icon_hub9491570ac57158c0eeecc95c95b13e5_20247_512x512_fill_lanczos_center_3.png</url><title>open-data</title><link>https://danielantal.eu/hu/tag/open-data/</link></image><item><title>Open Music Registers</title><link>https://danielantal.eu/hu/publication/2025_open_music_registers/</link><pubDate>Tue, 29 Apr 2025 00:00:00 +0000</pubDate><guid>https://danielantal.eu/hu/publication/2025_open_music_registers/</guid><description>&lt;h2 id="about-this-release">About this Release&lt;/h2>
&lt;p>This technical paper is part of the &lt;strong>Open Music Observatory&lt;/strong> under the Horizon Europe &lt;em>Open Music Europe&lt;/em> project.&lt;br>
It presents an early framework for federated music registers and demonstrates how they can support &lt;strong>rights management, cultural statistics, and business innovation&lt;/strong>.&lt;/p>
&lt;p>The current edition describes the design principles and pilot implementations.&lt;br>
Future editions will extend the model with more data partners, stress-tested pipelines, and additional use cases.&lt;/p>
&lt;div class="alert alert-note">
&lt;div>
&lt;strong>Note:&lt;/strong> This is a &lt;strong>technical release&lt;/strong> and should be cited using the DOI: &lt;a href="https://doi.org/10.5281/zenodo.14767717" target="_blank" rel="noopener">10.5281/zenodo.14767717&lt;/a>.
&lt;/div>
&lt;/div>
&lt;h2 id="participate">Participate&lt;/h2>
&lt;p>We invite music industry partners, cultural institutions, and researchers to &lt;strong>engage with the pilot registers&lt;/strong> and help refine the model.&lt;br>
Please visit the &lt;a href="https://doi.org/10.5281/zenodo.14767717" target="_blank" rel="noopener">Zenodo record&lt;/a> or the &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Open Music Observatory&lt;/a> for more information.&lt;/p></description></item><item><title>Big Data for All: Building Collaborative Data Observatories</title><link>https://danielantal.eu/hu/post/2022-11-03_ehv_innovation_cafe/</link><pubDate>Thu, 03 Nov 2022 17:30:00 +0000</pubDate><guid>https://danielantal.eu/hu/post/2022-11-03_ehv_innovation_cafe/</guid><description>&lt;p>Reprex&amp;rsquo;s co-founder, &lt;a href="https://danielantal.eu/authors/daniel_antal">Daniel Antal&lt;/a> talked in the &lt;a href="https://www.ehvinnovationcafe.org/past-events/" target="_blank" rel="noopener">Eindhoven Innovation Café&lt;/a> about these issues. You can watch the recorded version of the the livestream that starts at 5 minutes and 22 seconds:&lt;/p>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/kM54gAAbHY0" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
&lt;p>&lt;em>This is a past event&lt;/em>. Check out our forthcoming &lt;a href="https://danielantal.eu/#talks">events&lt;/a> or write to &lt;a href="https://www.linkedin.com/in/antaldaniel/" target="_blank" rel="noopener">
&lt;i class="fab fa-linkedin pr-1 fa-fw">&lt;/i> Daniel Antal&lt;/a> or to &lt;a href="https://keybase.io/antaldaniel" target="_blank" rel="noopener">
&lt;i class="fab fa-keybase pr-1 fa-fw">&lt;/i> antaldaniel&lt;/a>. Or send an &lt;a href="https://danielantal.eu/contact/">
&lt;i class="fas fa-envelope pr-1 fa-fw">&lt;/i> email&lt;/a>.&lt;/p>
&lt;h2 id="the-event-invitation-text-and-links">The event invitation text and links&lt;/h2>
&lt;p>&lt;code>Big data and AI creates inequalities&lt;/code>. It puts historically marginalized people, like ethnic minorities, and womxn, at a disadvantage. Because AI and checking on AI require plenty of data, usually only giant corporations, the wealthiest governments, and university entities can make it work for them. Reprex is a Hague-based, international startup that wants to impact various sustainable development goals by enabling smaller organizations to join their smaller datasets, use open data, create linked available data, and collaboratively make a change.&lt;/p>
&lt;p>Reprex is a finalist for the &lt;code>Hague Innovation Award&lt;/code> for impact startup (please 🙏, &lt;a href="https://reprex.nl/post/2022-10-29_reprex-talk-to-all/" target="_blank" rel="noopener">vote for us&lt;/a>!). Daniel Antal, one of the co-founders, will talk about their approach to building an international coalition of music organizations to pool data and challenge data monopolies using organizational techniques, a collaboration ethos, and data from the open-source developer world.&lt;/p>
&lt;p>Using the example of independent music creators, who often find themselves in a position where it is more expensive to claim their money from global platforms, he will talk about how to reduce inequalities in the world of big data and AI with collaboration on web 3.0. In the Q&amp;amp;A he will take questions on how to apply their know-how, and generally linked open data to other art+tech or creative segments or problems for which everybody is too small, like meeting the Paris Accord greenhouse gas targets bit by bit, small company by small company.&lt;/p>
&lt;h2 id="in-the-qa-we-can-discuss-many-things">In the Q&amp;amp;A, we can discuss many things&lt;/h2>
&lt;ul>
&lt;li>&lt;input checked="" disabled="" type="checkbox"> How can Reprex help an individual creator in music, or in fashion and design, or any other area?&lt;/li>
&lt;li>&lt;input checked="" disabled="" type="checkbox"> What sort of help it can give to researchers, research institutes, specialist consultancies, law firms, and other knowledge-based actors?&lt;/li>
&lt;/ul>
&lt;p>What sort of partners is &lt;a href="https://reprex.nl/" target="_blank" rel="noopener">Reprex&lt;/a> looking for in &lt;code>Eindhoven&lt;/code>?&lt;/p>
&lt;h2 id="check-out-our-projects">Check out our projects&lt;/h2>
&lt;ul>
&lt;li>&lt;input checked="" disabled="" type="checkbox"> &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Digital Music Observatory&lt;/a> and &lt;a href="https://music.dataobservatory.eu/project/listen-local/" target="_blank" rel="noopener">Listen Local&lt;/a>&lt;/li>
&lt;li>&lt;input checked="" disabled="" type="checkbox"> &lt;a href="https://ccsi.dataobservatory.eu/" target="_blank" rel="noopener">Cultural &amp;amp; Creative Sectors and Industries Observatory&lt;/a> and short call for potential partners.&lt;/li>
&lt;li>&lt;input checked="" disabled="" type="checkbox"> &lt;a href="https://greendeal.dataobservatory.eu/" target="_blank" rel="noopener">Green Deal Data Observatory&lt;/a> and simple, connected, financial and sustainability reporting for creative enterprises and others&lt;/li>
&lt;/ul>
&lt;h2 id="reprex-the-impact-startup">Reprex: the impact startup&lt;/h2>
&lt;ul>
&lt;li>&lt;input checked="" disabled="" type="checkbox"> Check out our accomplishments since the foundation in 2020&lt;/li>
&lt;/ul></description></item><item><title>Digital Music Observatory on the MaMA Convention 2021, Paris, FR</title><link>https://danielantal.eu/hu/event/2021_10_15_mama/</link><pubDate>Thu, 14 Oct 2021 11:00:00 +0000</pubDate><guid>https://danielantal.eu/hu/event/2021_10_15_mama/</guid><description>&lt;p>Currently more than half of the global music sales are made by autonomous AI systems owned by Google, Apple, or Spotify. These data monopolies are getting rich, because they reap the profit from music businesses with an average employee count of 1.8 Europe. European music businesses are easy to exploit with armies of data engineers and data scientists because they do not have a single data scientist or even an IT function.&lt;/p>
&lt;ol>
&lt;li>
&lt;p>Artists in the UK had a difficulty explaining in Westminster how they are losing out in streaming– so we have created a streaming price index, like the Dow Jones, if you like, that explains the economic factors of the devaluation of music in the last 5 years in 20 countries. (See &lt;a href="https://music.dataobservatory.eu/publication/mce_empirical_streaming_2021/" target="_blank" rel="noopener">our report&lt;/a>.)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Music organizations in Slovakia and Hungary were frustrated that their politicians and journalists believed music to be taxpayer funded, so we showed with data that they contribute more proportionally to the national budget than car manufacturers, the darling of local politicians (See our reports in &lt;a href="https://music.dataobservatory.eu/publication/hungary_music_industry_2014/" target="_blank" rel="noopener">Hungary&lt;/a> (recast several times) and in &lt;a href="https://music.dataobservatory.eu/publication/slovak_music_industry_2019/" target="_blank" rel="noopener">Slovakia&lt;/a>.)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We successfully challenged with data restaurant associations, hotel chains, telecom corporations and broadcasters who wanted to bring music prices down in court and via lobbying.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>The music industry has envied the television and film industry which has a single go-to-point for data when it needs them, the European Audiovisual Observatory. It started lobbying for a publicly financed music observatory. But we did not wait. The music industry has a tragic track record of failed centralized international data projects. We built Reprex out of a 12-country, decentralized music project. We learned how to utilize hidden, but already existing data and research funds well, and how to manage the data governance among the poisonous conflicts of interests between rich and poor countries, authors vs producers, producer’s vs performers.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Our &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Digital Music Observatory&lt;/a> is not theoretical, it is practical, because it is built around real-life court cases, damage claims, lobbying and PR arguments.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Our Digital Music Observatory is comprehensive – it contains more than a thousand indicators from all European countries. We have enough data to test the biases of the Spotify or the YouTube algorithm – you would be surprised what the data tells us.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>It has data available much sooner, in much higher quality and in a more practical format than in the Audiovisual one.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="presentation-slides">Presentation Slides&lt;/h2>
&lt;p>You can see the presentation slides &lt;a href="https://reprex.nl/slides/mama_2021/#/" target="_blank" rel="noopener">here&lt;/a>.&lt;/p></description></item><item><title>Economic and Environment Impact Analysis, Automated for Data-as-Service</title><link>https://danielantal.eu/hu/post/2021-06-03-iotables-release/</link><pubDate>Thu, 03 Jun 2021 16:00:00 +0000</pubDate><guid>https://danielantal.eu/hu/post/2021-06-03-iotables-release/</guid><description>&lt;p>We have released a new version of
&lt;a href="https://iotables.dataobservatory.eu/" target="_blank" rel="noopener">iotables&lt;/a> as part of the
&lt;a href="http://ropengov.org/" target="_blank" rel="noopener">rOpenGov&lt;/a> project. The package, as the name
suggests, works with European symmetric input-output tables (SIOTs).
SIOTs are among the most complex governmental statistical products. They
show how each country’s 64 agricultural, industrial, service, and
sometimes household sectors relate to each other. They are estimated
from various components of the GDP, tax collection, at least every five
years.&lt;/p>
&lt;p>SIOTs offer great value to policy-makers and analysts to make more than
educated guesses on how a million euros, pounds or Czech korunas spent
on a certain sector will impact other sectors of the economy, employment
or GDP. What happens when a bank starts to give new loans and advertise
them? How is an increase in economic activity going to affect the amount
of wages paid and and where will consumers most likely spend their
wages? As the national economies begin to reopen after COVID-19 pandemic
lockdowns, is to utilize SIOTs to calculate direct and indirect
employment effects or value added effects of government grant programs
to sectors such as cultural and creative industries or actors such as
venues for performing arts, movie theaters, bars and restaurants.&lt;/p>
&lt;p>Making such calculations requires a bit of matrix algebra, and
understanding of input-output economics, direct, indirect effects, and
multipliers. Economists, grant designers, policy makers have those
skills, but until now, such calculations were either made in cumbersome
Excel sheets, or proprietary software, as the key to these calculations
is to keep vectors and matrices, which have at least one dimension of
64, perfectly aligned. We made this process reproducible with
&lt;a href="https://iotables.dataobservatory.eu/" target="_blank" rel="noopener">iotables&lt;/a> and
&lt;a href="https://CRAN.R-project.org/package=eurostat" target="_blank" rel="noopener">eurostat&lt;/a> on
&lt;a href="http://ropengov.org/" target="_blank" rel="noopener">rOpenGov&lt;/a>&lt;/p>
&lt;figure id="figure-our-iotables-package-creates-direct-indirect-effects-and-multipliers-programatically-our-observatory-will-make-those-indicators-available-for-all-european-countries">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://danielantal.eu/media/img/package_screenshots/iotables_0_4_5.png" alt="Our iotables package creates direct, indirect effects and multipliers programatically. Our observatory will make those indicators available for all European countries." loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="&amp;nbsp;" data-post=". ábra:&amp;nbsp;" class="numbered">
Our iotables package creates direct, indirect effects and multipliers programatically. Our observatory will make those indicators available for all European countries.
&lt;/figcaption>&lt;/figure>
&lt;h2 id="accessing-and-tidying-the-data-programmatically">Accessing and tidying the data programmatically&lt;/h2>
&lt;p>The iotables package is in a way an extension to the &lt;em>eurostat&lt;/em> R
package, which provides a programmatic access to the
&lt;a href="https://ec.europa.eu/eurostat" target="_blank" rel="noopener">Eurostat&lt;/a> data warehouse. The reason for
releasing a new package is that working with SIOTs requires plenty of
meticulous data wrangling based on various &lt;em>metadata&lt;/em> sources, apart
from actually accessing the &lt;em>data&lt;/em> itself. When working with matrix
equations, the bar is higher than with tidy data. Not only your rows and
columns must match, but their ordering must strictly conform the
quadrants of the a matrix system, including the connecting trade or tax
matrices.&lt;/p>
&lt;p>When you download a country’s SIOT table, you receive a long form data
frame, a very-very long one, which contains the matrix values and their
labels like this:&lt;/p>
&lt;pre>&lt;code>## Table naio_10_cp1700 cached at C:\Users\...\Temp\RtmpGQF4gr/eurostat/naio_10_cp1700_date_code_FF.rds
# we save it for further reference here
saveRDS(naio_10_cp1700, &amp;quot;not_included/naio_10_cp1700_date_code_FF.rds&amp;quot;)
# should you need to retrieve the large tempfiles, they are in
dir (file.path(tempdir(), &amp;quot;eurostat&amp;quot;))
dplyr::slice_head(naio_10_cp1700, n: 5)
## # A tibble: 5 x 7
## unit stk_flow induse prod_na geo time values
## &amp;lt;chr&amp;gt; &amp;lt;chr&amp;gt; &amp;lt;chr&amp;gt; &amp;lt;chr&amp;gt; &amp;lt;chr&amp;gt; &amp;lt;date&amp;gt; &amp;lt;dbl&amp;gt;
## 1 MIO_EUR DOM CPA_A01 B1G EA19 2019-01-01 141873.
## 2 MIO_EUR DOM CPA_A01 B1G EU27_2020 2019-01-01 174976.
## 3 MIO_EUR DOM CPA_A01 B1G EU28 2019-01-01 187814.
## 4 MIO_EUR DOM CPA_A01 B2A3G EA19 2019-01-01 0
## 5 MIO_EUR DOM CPA_A01 B2A3G EU27_2020 2019-01-01 0
&lt;/code>&lt;/pre>
&lt;p>The metadata reads like this: the units are in millions of euros, we are
analyzing domestic flows, and the national account items &lt;code>B1-B2&lt;/code> for the
industry &lt;code>A01&lt;/code>. The information of a 64x64 matrix (the SIOT) and its
connecting matrices, such as taxes, or employment, or &lt;em>C**O&lt;/em>&lt;sub>2&lt;/sub>
emissions, must be placed exactly in one correct ordering of columns and
rows. Every single data wrangling error will usually lead in an error
(the matrix equation has no solution), or, what is worse, in a very
difficult to trace algebraic error. Our package not only labels this
data meaningfully, but creates very tidy data frames that contain each
necessary matrix of vector with a key column.&lt;/p>
&lt;p>iotables package contains the vocabularies (abbreviations and human
readable labels) of three statistical vocabularies: the so called
&lt;code>COICOP&lt;/code> product codes, the &lt;code>NACE&lt;/code> industry codes, and the vocabulary of
the &lt;code>ESA2010&lt;/code> definition of national accounts (which is the government
equivalent of corporate accounting).&lt;/p>
&lt;p>Our package currently solves all equations for direct, indirect effects,
multipliers and inter-industry linkages. Backward linkages show what
happens with the suppliers of an industry, such as catering or
advertising in the case of music festivals, if the festivals reopen. The
forward linkages show how much extra demand this creates for connecting
services that treat festivals as a ‘supplier’, such as cultural tourism.&lt;/p>
&lt;h2 id="lets-seen-an-example">Let’s seen an example&lt;/h2>
&lt;pre>&lt;code>## Downloading employment data from the Eurostat database.
## Table lfsq_egan22d cached at C:\Users\...\Temp\RtmpGQF4gr/eurostat/lfsq_egan22d_date_code_FF.rds
&lt;/code>&lt;/pre>
&lt;p>and match it with the latest structural information on from the
&lt;a href="http://appsso.eurostat.ec.europa.eu/nui/show.do?wai=true&amp;amp;dataset=naio_10_cp1700" target="_blank" rel="noopener">Symmetric input-output table at basic prices (product by
product)&lt;/a>
Eurostat product. A quick look at the Eurostat website already shows
that there is a lot of work ahead to make the data look like an actual
Symmetric input-output table. Download it with &lt;code>iotable_get()&lt;/code> which
does basic labelling and preprocessing on the raw Eurostat files.
Because of the size of the unfiltered dataset on Eurostat, the following
code may take several minutes to run.&lt;/p>
&lt;pre>&lt;code>sk_io &amp;lt;- iotable_get ( labelled_io_data: NULL,
source: &amp;quot;naio_10_cp1700&amp;quot;, geo: &amp;quot;SK&amp;quot;,
year: 2015, unit: &amp;quot;MIO_EUR&amp;quot;,
stk_flow: &amp;quot;TOTAL&amp;quot;,
labelling: &amp;quot;iotables&amp;quot; )
## Reading cache file C:\Users\..\Temp\RtmpGQF4gr/eurostat/naio_10_cp1700_date_code_FF.rds
## Table naio_10_cp1700 read from cache file: C:\Users\..\Temp\RtmpGQF4gr/eurostat/naio_10_cp1700_date_code_FF.rds
## Saving 808 input-output tables into the temporary directory
## C:\Users\...\Temp\RtmpGQF4gr
## Saved the raw data of this table type in temporary directory C:\Users\...\Temp\RtmpGQF4gr/naio_10_cp1700.rds.
&lt;/code>&lt;/pre>
&lt;p>The &lt;code>input_coefficient_matrix_create()&lt;/code> creates the input coefficient
matrix, which is used for most of the analytical functions.&lt;/p>
&lt;p>&lt;em>a&lt;/em>&lt;sub>&lt;em>i**j&lt;/em>&lt;/sub>: &lt;em>X&lt;/em>&lt;sub>&lt;em>i**j&lt;/em>&lt;/sub> / &lt;em>x&lt;/em>&lt;sub>&lt;em>j&lt;/em>&lt;/sub>&lt;/p>
&lt;p>It checks the correct ordering of columns, and furthermore it fills up 0
values with 0.000001 to avoid division with zero.&lt;/p>
&lt;pre>&lt;code>input_coeff_matrix_sk &amp;lt;- input_coefficient_matrix_create(
data_table: sk_io
)
## Columns and rows of real_estate_imputed_a, extraterriorial_organizations are all zeros and will be removed.
&lt;/code>&lt;/pre>
&lt;p>Then you can create the Leontieff-inverse, which contains all the
structural information about the relationships of 64x64 sectors of the
chosen country, in this case, Slovakia, ready for the main equations of
input-output economics.&lt;/p>
&lt;pre>&lt;code>I_sk &amp;lt;- leontieff_inverse_create(input_coeff_matrix_sk)
&lt;/code>&lt;/pre>
&lt;p>And take out the primary inputs:&lt;/p>
&lt;pre>&lt;code>primary_inputs_sk &amp;lt;- coefficient_matrix_create(
data_table: sk_io,
total: 'output',
return: 'primary_inputs')
## Columns and rows of real_estate_imputed_a, extraterriorial_organizations are all zeros and will be removed.
&lt;/code>&lt;/pre>
&lt;p>Now let’s see if there the government tries to stimulate the economy in
three sectors, agricultulre, car manufacturing, and R&amp;amp;D with a billion
euros. Direct effects measure the initial, direct impact of the change
in demand and supply for a product. When production goes up, it will
create demand in all supply industries (backward linkages) and create
opportunities in the industries that use the product themselves (forward
linkages.)&lt;/p>
&lt;pre>&lt;code>direct_effects_create( primary_inputs_sk, I_sk ) %&amp;gt;%
select ( all_of(c(&amp;quot;iotables_row&amp;quot;, &amp;quot;agriculture&amp;quot;,
&amp;quot;motor_vechicles&amp;quot;, &amp;quot;research_development&amp;quot;))) %&amp;gt;%
filter (.data$iotables_row %in% c(&amp;quot;gva_effect&amp;quot;, &amp;quot;wages_salaries_effect&amp;quot;,
&amp;quot;imports_effect&amp;quot;, &amp;quot;output_effect&amp;quot;))
## iotables_row agriculture motor_vechicles research_development
## 1 imports_effect 1.3684350 2.3028203 0.9764921
## 2 wages_salaries_effect 0.2713804 0.3183523 0.3828014
## 3 gva_effect 0.9669621 0.9790771 0.9669467
## 4 output_effect 2.2876287 3.9840251 2.2579634
&lt;/code>&lt;/pre>
&lt;p>Car manufacturing requires much imported components, so each extra
demand will create a large importing activity. The R&amp;amp;D will create a the
most local wages (and supports most jobs) because research is
job-intensive. As we can see, the effect on imports, wages, gross value
added (which will end up in the GDP) and output changes are very
different in these three sectors.&lt;/p>
&lt;p>This is not the total effect, because some of the increased production
will translate into income, which in turn will be used to create further
demand in all parts of the domestic economy. The total effect is
characterized by multipliers.&lt;/p>
&lt;p>Then solve for the multipliers:&lt;/p>
&lt;pre>&lt;code>multipliers_sk &amp;lt;- input_multipliers_create(
primary_inputs_sk %&amp;gt;%
filter (.data$iotables_row == &amp;quot;gva&amp;quot;), I_sk )
&lt;/code>&lt;/pre>
&lt;p>And select a few industries:&lt;/p>
&lt;pre>&lt;code>set.seed(12)
multipliers_sk %&amp;gt;%
tidyr::pivot_longer ( -all_of(&amp;quot;iotables_row&amp;quot;),
names_to: &amp;quot;industry&amp;quot;,
values_to: &amp;quot;GVA_multiplier&amp;quot;) %&amp;gt;%
select (-all_of(&amp;quot;iotables_row&amp;quot;)) %&amp;gt;%
arrange( -.data$GVA_multiplier) %&amp;gt;%
dplyr::sample_n(8)
## # A tibble: 8 x 2
## industry GVA_multiplier
## &amp;lt;chr&amp;gt; &amp;lt;dbl&amp;gt;
## 1 motor_vechicles 7.81
## 2 wood_products 2.27
## 3 mineral_products 2.83
## 4 human_health 1.53
## 5 post_courier 2.23
## 6 sewage 1.82
## 7 basic_metals 4.16
## 8 real_estate_services_b 1.48
&lt;/code>&lt;/pre>
&lt;h2 id="vignettes">Vignettes&lt;/h2>
&lt;p>The &lt;a href="https://iotables.dataobservatory.eu/articles/germany_1990.html" target="_blank" rel="noopener">Germany
1990&lt;/a>
provides an introduction of input-output economics and re-creates the
examples of the &lt;a href="https://iotables.dataobservatory.eu/articles/germany_1990.html" target="_blank" rel="noopener">Eurostat Manual of Supply, Use and Input-Output
Tables&lt;/a>,
by Jörg Beutel (Eurostat Manual).&lt;/p>
&lt;p>The &lt;a href="https://iotables.dataobservatory.eu/articles/united_kingdom_2010.html" target="_blank" rel="noopener">United Kingdom Input-Output Analytical Tables Daniel Antal, based
on the work edited by Richard
Wild&lt;/a>
is a use case on how to correctly import data from outside Eurostat
(i.e. not with &lt;code>eurostat::get_eurostat()&lt;/code>) and join it properly to a
SIOT. We also used this example to create unit tests of our functions
from a published, official government statistical release.&lt;/p>
&lt;p>Finally, &lt;a href="https://iotables.dataobservatory.eu/articles/working_with_eurostat.html" target="_blank" rel="noopener">Working With Eurostat
Data&lt;/a>
is a detailed use case of working with all the current functionalities
of the package by comparing two economies, Czechia and Slovakia and
guides you through a lot more examples than this short blogpost.&lt;/p>
&lt;p>Our package was originally developed to calculate GVA and employment
effects for the Slovak music industry (see our &lt;a href="https://music.dataobservatory.eu/publication/slovak_music_industry_2019/" target="_blank" rel="noopener">Slovak Music Industry Report&lt;/a>), and similar calculations for the
Hungarian film tax shelter. We can now programatically create
reproducible multipliers for all European economies in the &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Digital
Music Observatory&lt;/a>, and create
further indicators for economic policy making in the &lt;a href="https://economy.dataobservatory.eu/" target="_blank" rel="noopener">Economy Data
Observatory&lt;/a>.&lt;/p>
&lt;h2 id="environmental-impact-analysis">Environmental Impact Analysis&lt;/h2>
&lt;p>Our package allows the calculation of various economic policy scenarios,
such as changing the VAT on meat or effects of re-opening music
festivals on aggregate demand, GDP, tax revenues, or employment. But
what about the &lt;em>C**O&lt;/em>&lt;sub>2&lt;/sub>, methane and other greenhouse gas
effects of the reopening festivals, or the increasing meat prices?&lt;/p>
&lt;p>Technically our package can already calculate such effects, but to do
so, you have to carefully match further statistical vocabulary items
used by the European Environmental Agency about air pollutants and
greenhouse gases.&lt;/p>
&lt;p>The last released version of &lt;em>iotables&lt;/em> is Importing and Manipulating
Symmetric Input-Output Tables (Version 0.4.4). Zenodo.
&lt;a href="https://zenodo.org/record/4897472" target="_blank" rel="noopener">https://doi.org/10.5281/zenodo.4897472&lt;/a>,
but we are alread working on a new major release. In that release, we
are planning to build in the necessary vocabulary into the metadata
functions to increase the functionality of the package, and create new
indicators for our &lt;a href="https://greendeal.dataobservatory.eu/" target="_blank" rel="noopener">Green Deal Data
Observatory&lt;/a>. This experimental
data observatory is creating new, high quality statistical indicators
from open governmental and open science data sources that has not seen
the daylight yet.&lt;/p>
&lt;h2 id="ropengov-and-the-eu-datathon-challenges">rOpenGov and the EU Datathon Challenges&lt;/h2>
&lt;figure id="figure-ropengov-reprex-and-other-open-collaboration-partners-teamed-up-to-build-on-our-expertise-of-open-source-statistical-software-development-further-we-want-to-create-a-technologically-and-financially-feasible-data-as-service-to-put-our-reproducible-research-products-into-wider-user-for-the-business-analyst-scientific-researcher-and-evidence-based-policy-design-communities">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://danielantal.eu/media/img/partners/rOpenGov-intro.png" alt="rOpenGov, Reprex, and other open collaboration partners teamed up to build on our expertise of open source statistical software development further: we want to create a technologically and financially feasible data-as-service to put our reproducible research products into wider user for the business analyst, scientific researcher and evidence-based policy design communities." loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="&amp;nbsp;" data-post=". ábra:&amp;nbsp;" class="numbered">
rOpenGov, Reprex, and other open collaboration partners teamed up to build on our expertise of open source statistical software development further: we want to create a technologically and financially feasible data-as-service to put our reproducible research products into wider user for the business analyst, scientific researcher and evidence-based policy design communities.
&lt;/figcaption>&lt;/figure>
&lt;p>&lt;a href="http://ropengov.org/" target="_blank" rel="noopener">rOpenGov&lt;/a> is a community of open governmental
data and statistics developers with many packages that make programmatic
access and work with open data possible in the R language.
&lt;a href="https://reprex.nl/" target="_blank" rel="noopener">Reprex&lt;/a> is a Dutch-startup that teamed up with
rOpenGov and other open collaboration partners to create a
technologically and financially feasible service to exploit reproducible
research products for the wider business, scientific and evidence-based
policy design community. Open data is a legal concept - it means that
you have the rigth to reuse the data, but often the reuse requires
significant programming and statistical know-how. We entered into the
annual &lt;a href="https://reprex.nl/project/eu-datathon_2021/" target="_blank" rel="noopener">EU Datathon&lt;/a>
competition in all three challenges with our applications to not only
provide open-source software, but daily updated, validated, documented,
high-quality statistical indicators as open data in an open database.
Our &lt;a href="https://iotables.dataobservatory.eu/" target="_blank" rel="noopener">iotables&lt;/a> package is one of
our many open-source building blocks to make open data more accessible
to all.&lt;/p>
&lt;p>&lt;em>Join our open collaboration Digital Music Observatory team as a &lt;a href="https://music.dataobservatory.eu/authors/curator" target="_blank" rel="noopener">data curator&lt;/a>, &lt;a href="https://music.dataobservatory.eu/authors/developer" target="_blank" rel="noopener">developer&lt;/a> or &lt;a href="https://music.dataobservatory.eu/authors/team" target="_blank" rel="noopener">business developer&lt;/a>. More interested in environmental impact analysis? Try our &lt;a href="https://greendeal.dataobservatory.eu/#contributors" target="_blank" rel="noopener">Green Deal Data Observatory&lt;/a> team! Or economic policies, particularly computation antitrust, innovation and small enterprises? Check out our &lt;a href="https://economy.dataobservatory.eu/#contributors" target="_blank" rel="noopener">Economy Music Observatory&lt;/a> team!&lt;/em>&lt;/p></description></item><item><title>Reprex Open Data Day 2021</title><link>https://danielantal.eu/hu/event/2021_03_06_odd/</link><pubDate>Sat, 06 Mar 2021 15:30:00 +0200</pubDate><guid>https://danielantal.eu/hu/event/2021_03_06_odd/</guid><description>&lt;p>&lt;a href="https://opendataday.org/" target="_blank" rel="noopener">Open Data Day&lt;/a> is an annual celebration of open data all over the world. It is an opportunity to show the benefits of open data and encourage the adoption of open data policies in government, business, and civil society. Reprex is a start-up that utilizes open data with open-source reproducible research: please challenge us with your data requests and participate in our web events.&lt;/p>
&lt;p>The &lt;code>Reprex Open Data Day 2021&lt;/code> will be two informal conversations based on a series of run up introductory blogposts centered around two themes. Because important guests became ill in the last days, we are going to consolidate the two talks into one with less structure. We want to create an informal, inclusive, collaborative online event on International Open Data Day 2021. Please, grab a tea, coffee, or even a beer, and join us for an informal conversation. We hope that we will finish the afternoon with ideas on new, open-data driven collaborations.&lt;/p>
&lt;p>&lt;code>9.30 EST / 15.30 CET&lt;/code>: &lt;strong>Open collaboration in business, policy and science.&lt;/strong> Creating evidence-based policy, business strategy or scientific research with small contributions with independent components with incentives. Short introduction with examples: joining environmental sensory data and public opinion data on maps; creating harmonized datasets across the Arab world. Survey harmonization, mapping, data products. &lt;strong>Scaling up open collaboration: making small organizations competitive with big tech in the big data era.&lt;/strong> Data sharing, data pooling, data altruism and observatories. The new European trustworthy AI and data governance agenda.&lt;/p>
&lt;p>You can &lt;a href="https://danielantal.eu/presentations/reprex_open_data_day_2021.html#/reprex">click through&lt;/a> a short presentation to familiarize yourself with our topics.&lt;/p>
&lt;p>See you &lt;a href="https://meet.jit.si/ReprexOpenDataDay2021" target="_blank" rel="noopener">here&lt;/a>.&lt;/p>
&lt;p>&lt;strong>Case studies:&lt;/strong>&lt;/p>
&lt;ol>
&lt;li>
&lt;p>We are connecting raw survey data about Climate Awareness in Eurobarometer surveys. Here is the &lt;a href="https://rpubs.com/antaldaniel/734594" target="_blank" rel="noopener">reproduction code&lt;/a> (&lt;em>intermediate to advanced R needed&lt;/em>.) You should use the &lt;em>development&lt;/em> version of our &lt;a href="retroharmonize.dataobservatory.eu">retroharmonize&lt;/a> package at &lt;a href="https://github.com/antaldaniel/retroharmonize" target="_blank" rel="noopener">github.com/antaldaniel/retroharmonize&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We are tracking changes in the boundaries of provinces, states, counties, parishes with our regions open source software &amp;ndash; &lt;a href="https://rpubs.com/antaldaniel/regions-OOD21" target="_blank" rel="noopener">reproduction code here&lt;/a>. You will need our &lt;a href="regions.dataobservatory.eu">regions&lt;/a> package which is available on CRAN or in the rOpenGov GitHub repo.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We will talk about how to join this with air pollution data and put it on the map with &lt;a href="https://dataandlyrics.com/post/2021-03-03-ood_interview_maps/" target="_blank" rel="noopener">Milos Popovic&lt;/a>, who prepared this nice choropleth animation.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://danielantal.eu/media/gif/eu_climate_change.gif" alt="Milos Popovic&amp;amp;rsquo;s maps made from the case study." loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;ol start="4">
&lt;li>We will discuss data observatories (permanent data collection programs), open collaboration (open-source inspired way of cooperation among small and large independent actors) and data altruism.&lt;/li>
&lt;/ol>
&lt;p>Any questions: send Daniel a message on &lt;a href="https://keybase.io/antaldaniel" target="_blank" rel="noopener">Keybase&lt;/a>, Whatsapp or &lt;a href="https://dataandlyrics.com/#contact" target="_blank" rel="noopener">email&lt;/a>.&lt;/p></description></item><item><title>Reprex introduction in IVIR, Amsterdam, NL</title><link>https://danielantal.eu/hu/event/2021_04_09_ivirtual/</link><pubDate>Tue, 02 Feb 2021 10:10:00 +0000</pubDate><guid>https://danielantal.eu/hu/event/2021_04_09_ivirtual/</guid><description>&lt;p>IViRtual 9 April 2021&lt;/p></description></item><item><title>Product/Market Fit Validation in Yes!Delft</title><link>https://danielantal.eu/hu/post/2020-09-25-yesdelft-validation/</link><pubDate>Fri, 25 Sep 2020 15:31:39 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-09-25-yesdelft-validation/</guid><description>&lt;p>We would like to validate our product market/fit in two segments, business/policy research and scientific research, with a supporting role given to data journalism. Because we want to follow a bootstrapping strategy, we must focus on those clients where we find the highest value proposition, which is of course easier said than done. We see much interest in our offering from other continents, therefore we truly welcome the opportunity that we can do this on a truly global business canvas in one of the worlds’ &lt;a href="https://www.yesdelft.com/news/yesdelft-among-the-top-5-business-incubators-in-the-world/" target="_blank" rel="noopener">top five incubators&lt;/a>, the number 2 university-backed incubator in the world, second to none in Europe, in the &lt;a href="https://www.yesdelft.com/focus-areas/artificial-intelligence/" target="_blank" rel="noopener">Yes!Delft AI+Blockchain&lt;/a> Validation Lab.&lt;/p>
&lt;p>In Europe hundreds of thousands of microenterprises, such as record labels, video producers or book publishers are facing data and AI giants like Google’s YouTube, Apple Music, Spotify, Netflix or Amazon. If the recommendation engines of these giants do not recommend their songs, films or books, then their investments are doomed to fail, because about half of the global sales are driven by AI algorithms. When they make a claim for the missing money, they will immediately find themselves in a dispute with gigabytes of data that they can only handle with a data scientist, even though they do not even have an IT professional or an HR professional to make the hire.&lt;/p>
&lt;p>An awful lot of money, creativity and real values are at stake, and we want to be on the creator’s side, their technician’s side, their manager’s side when they want to get a fair share from the pie and they want to help these industry leader to make the pie grow.&lt;/p>
&lt;p>The &lt;a href="http://www.unesco.org/new/en/culture/themes/creativity/arts-education/research-cooperation/observatories/" target="_blank" rel="noopener">UNESCO&lt;/a> and the EU have been promoting as an organizational solution the fragmentation problem with the so-called data observatories that are pooling the business, policy, and scientific research needs of various domains, like music. This is an idea that we really like, and we believe that our research automation solutions can help these observatories to grow faster as ecosystems, create better quality and more timely data and research products and a far lower cost.&lt;/p>
&lt;p>We define ourselves as a reproducible research company inspired by the philosophy of open collaboration, based on open-source software and open data. We want to explore various revenue models around these ideas.&lt;/p>
&lt;ol>
&lt;li>
&lt;p>We are not committed to open source licensing if more permissive licensing policies provide us with better opportunities.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We would like to explore various data-as-service models, because we do not want to be locked into the position of cheap open data vendors.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We want to deploy AI applications that really help earning money in these sectors with playlisting, recommendation engines, forecasting applications, or royalty valuations, because our open collaboration approach brings up enough data sooner to than its alternatives, because it manages inherent conflicts of interests, fragmentation, and decentralization better than hierarchical solutions.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>&lt;strong>Timeline&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>
&lt;p>In January CEEMID reached its peak: we introduced a 12-country &lt;a href="https://dataobservatory.eu/post/2020-01-30-ceereport/" target="_blank" rel="noopener">reproducible research project&lt;/a> made with only freelancers in Brussels, presented as best use case of evidence-based policy design.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>In February Daniel visited the &lt;a href="https://dataobservatory.eu/post/yes-delft-co-lab/" target="_blank" rel="noopener">Yes!Delft Co-Lab&lt;/a> to find out who would be the best co-founder to re-launch CEEMID as an enterprise.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>In April we started to &lt;a href="https://dataobservatory.eu/post/2020-04-16-regional-opendata-release/" target="_blank" rel="noopener">release our data&lt;/a> as open data for validation.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>One month ago we &lt;a href="https://dataobservatory.eu/post/2020-08-24-start-up/" target="_blank" rel="noopener">started-up&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Then we launched the &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">music.dataobservatory.eu&lt;/a> project.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>A few other &lt;a href="https://music.dataobservatory.eu/annex.html#other-observatories" target="_blank" rel="noopener">data observatories&lt;/a>.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Bonus:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://www.palato.nl/" target="_blank" rel="noopener">Palato&lt;/a> in the Hague, where we took our selfie and had an absolutely amazing dinner after the pitch. Check them out!&lt;/li>
&lt;/ul></description></item><item><title>Reproducible Survey Harmonization: retroharmonize Is Released</title><link>https://danielantal.eu/hu/post/2020-09-21-retroharmonize_release/</link><pubDate>Mon, 21 Sep 2020 11:31:39 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-09-21-retroharmonize_release/</guid><description>&lt;p>Our original intention was to make surveying more accessible for music and creative industry partners, by relying more on already existing survey data, and better designing complementary, smaller surveys, becasue surveying, opinion polling is becoming increasingly expensive in the develop world. People are less and less likely to sit down for an interview in their houses. We have tried to harmonize our custom surveys, particuarly with Kantar in Hungary and Focus in Slovakia with exisiting EU projects. But we ended up making a part of international survey harmonization across countries and throughout years easier to automate.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://danielantal.eu/img/packages/ab_plot1.png" alt="Harmonized results from Afrobarometer" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;p>Surveys are like sensors for natural sciences and industrial production. They are essential for almost any social and economic statistical indicator, for calculating the inflation, parts of the GDP, participation in education programs. Making surveys easier to harmonize and exploit more already existing survey data can bring down research cost, and can increase research value at the same time. (See our earlier blog post &lt;a href="https://dataobservatory.eu/post/2020-07-10-retroharmonize/" target="_blank" rel="noopener">Increase The Value Of Market Research With Open Data And Survey Harmonization&lt;/a>.)&lt;/p>
&lt;p>So, if you are an R user, you can use &lt;code>install.packages(“retroharmonize”)&lt;/code> to get the released 0.1.13 version and make tutorials with real Eurobarometer or Afrobarometer microdata. With &lt;code>devtools::install_github(&amp;quot;antaldaniel/retroharmonize&amp;quot;)&lt;/code> you can already install the current development version 0.1.14, which handles perl-like regex, which will be necessary for our next tutorial in the making for &lt;a href="https://www.arabbarometer.org/" target="_blank" rel="noopener">Arab Barometer&lt;/a>.&lt;/p>
&lt;p>&lt;strong>Related&lt;/strong>:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">retroharmonize package website&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://github.com/antaldaniel/retroharmonize/" target="_blank" rel="noopener">retroharmonize on github&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul></description></item><item><title>Launching Our Demo Music Observatory</title><link>https://danielantal.eu/hu/post/2020-09-15-music-observatory-launch/</link><pubDate>Tue, 15 Sep 2020 08:00:39 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-09-15-music-observatory-launch/</guid><description>&lt;p>Today, on 15 September 2020, we officially launched our &lt;code>minimal viable product&lt;/code> as we promised to partners back in February. This was a particularly difficult period for everybody. We aspired to deliver by September in a very different environment, our hopes for commissioned work went up in flames with the pandemic, and our targeted users, musicians and music entrepreneurs, talent managers, music venues lost most of their income. The organizations helping them, granting authorities, export offices and collective management societies are overwhelmed with the problem. During these troublesome times, our team expanded, attracted great new talent, and kept working.&lt;/p>
&lt;p>Our first product is the &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Demo Music Observatory&lt;/a>, a collaborative, automated research-based &lt;a href="https://dataobservatory.eu/faq/observatories/" target="_blank" rel="noopener">observatory&lt;/a> for the music industry, one that is particularly hard hit by the COVID19 crisis. Not only great artists, composers, technicians, managers fell victim to the virus, but musicians lost about 50–90% of their income from live music. This translates to a 100% loss for the live music technicians and managers.&lt;/p>
&lt;p>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/fQJHflWPS34" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
See our &lt;a href="https://dataobservatory.eu/post/2020-09-11-creating-automated-observatory/" target="_blank" rel="noopener">earlier blogpost&lt;/a> on what you see on the video.&lt;/p>
&lt;p>The music industry was never a place for great job security. For putting up a show, you usually need a network of 10–200 artists, technicians and managers to work together as freelancers without all those social benefits that many people enjoy in other walks of life. We have been trying to figure out how to help this microenterprise and freelancer-network based industry with research for five years. Our aim is to make them competitive when they are talking with their buyers: Google, Apple, Spotify, who are really heavy-weight data and AI pros. Our better plan their tours, when they will be back on the road, to understand what sort of audiences and purchasing power waits for them in different European cities.&lt;/p>
&lt;p>We are launching at a time when the music industry is crying for help.Therefore, we have decided to make our demo observatory open and unfinished. Over the last 7 years, we have built up about 2000 music and creative sector indicators to be used for business KPIs, forecasting targets, grant evaluations, royalty valuations, concert demography target group analysis and other professional uses. We would like to open up, based on your needs, about 50 well-designed indicators, and pledge to keep it daily refreshed, corrected, documented, citaable, downloadable. Also, feel free to use our most valuable source code—use it for your own purposes, even modify it, as long as you keep it open.&lt;/p>
&lt;p>For our smaller partners, we follow what musicians do these days on Bandcamp: name your price. We make a pledge to our small partners: if you need reliable data to plan your next grant calls, calculate royalties, compensations, predict hit candidates, give us the job—and name your price. Post-corona, you can take for a dollar the best music from Bandcamp. You can take our research products, for a limited period, for any amount you name, as long as it is for a good cause and serves the industry, musicians, technicians or managers. In return, we ask for your feedback. Help us validate whether we are on the right track, tell us how we can cooperate after the pandemic, in better times.&lt;/p>
&lt;p>Our larger and better funded partners? We ask you to pay the price we name, because we believe that it is a well-justified, fair and competitive price, set by pricing experts.&lt;/p>
&lt;p>We appreciate it if you take a look at our offering, or if you pass this blogpost on to your colleagues in the industry. Our main target audience initially are music professional in broader Europe, but we are planning to cover all major global markets very soon, too. Feedback from the U.S., Australia, Canada, Colombia, Brazil &amp;amp; Argentina is particularly welcome as we have great plans over there!&lt;/p>
&lt;h2 id="who-we-are">Who we are?&lt;/h2>
&lt;p>We &lt;a href="https://dataobservatory.eu/post/2020-08-24-start-up/" target="_blank" rel="noopener">started&lt;/a> our operations on 1 September 2020 on the basis of &lt;a href="http://documentation.ceemid.eu/" target="_blank" rel="noopener">CEEMID&lt;/a>, a pan-European data observatory that created about 2000 music and creative industry indicators for its users. In the coming days, we are gradually opening up about 50 &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">music industry&lt;/a> and 50 broader creative industry indicators in a fully reproducible workflow, with daily re-freshed, re-processed, well-formatted and documented indicators for business and policy decisions.&lt;/p>
&lt;p>We would like to validate this approach in one of the world&amp;rsquo;s most prestigious university-backed incubator programs, in the &lt;a href="https://www.yesdelft.com/yes-programs/ai-blockchain-validation-lab/" target="_blank" rel="noopener">Yes!Delft AI/Blockchain Validation Lab&lt;/a>. We&amp;rsquo;re finalist on their selection, and all help before 23 September from our friends in the music industry is more than appreciated. If we get there, we can rely on probably the best pros in Europe to make our offering better tailored and financially sustainable.&lt;/p>
&lt;h2 id="get-in-touch">Get in touch!&lt;/h2>
&lt;p>We use the very simple and extremely secure &lt;strong>keybase.io&lt;/strong>, a kind of mix of Whatsapp, Skype, Google Drive, One Drive and zoom. You can get in touch on that platform with us in anytime &lt;a href="https://keybase.io/team/reprexcommunity" target="_blank" rel="noopener">here&lt;/a>.&lt;/p>
&lt;p>You can easily contact on LinkedIn &lt;a href="https://www.linkedin.com/in/antaldaniel/" target="_blank" rel="noopener">Daniel&lt;/a> or &lt;a href="https://www.linkedin.com/in/k%C3%A1tya-nagy-a9447730/" target="_blank" rel="noopener">Kátya&lt;/a> and of course, we have a usually working &lt;a href="https://dataobservatory.eu/#about" target="_blank" rel="noopener">email contact form&lt;/a>, too. Our email is name.surname at our main domain.&lt;/p>
&lt;h2 id="video-credits">Video credits&lt;/h2>
&lt;ul>
&lt;li>Data acquisition and processing: Daniel Antal, CFA and Marta Kołczyńska, PhD (&lt;a href="https://music.dataobservatory.eu/economy.html#demand" target="_blank" rel="noopener">survey data&lt;/a>).&lt;/li>
&lt;li>Documentation automation: Sandor Budai&lt;/li>
&lt;li>Video art: Line Matson&lt;/li>
&lt;li>Music: &lt;a href="https://www.youtube.com/moonmoonmoon" target="_blank" rel="noopener">Moon Moon Moon&lt;/a>.&lt;/li>
&lt;/ul></description></item><item><title>Creating An Automated Data Observatory</title><link>https://danielantal.eu/hu/post/2020-09-11-creating-automated-observatory/</link><pubDate>Fri, 11 Sep 2020 16:00:39 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-09-11-creating-automated-observatory/</guid><description>&lt;p>We are building data ecosystems, so called observatories, where scientific, business, policy and civic users can find factual information, data, evidence for their domain. Our open source, open data, open collaboration approach allows to connect various open and proprietary data sources, and our reproducible research workflows allow us to automate data collection, processing, publication, documentation and presentation.&lt;/p>
&lt;p>Our scripts are checking data sources, such as Eurostat&amp;rsquo;s Eurobase, Spotify&amp;rsquo;s API and other music industry sources every day for new information, and process any data corrections or new disclosure, interpolate, backcast or forecast missing values, make currency translations and unit conversions. This is shown illustrated with an &lt;a href="https://dataobservatory.eu/post/2020-07-25-reproducible_ingestion/" target="_blank" rel="noopener">earlier post&lt;/a>.&lt;/p>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/fQJHflWPS34" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
&lt;p>For direct access to the file visit &lt;a href="https://dataobservatory.eu/video/making-of-dmo.mp4" target="_blank" rel="noopener">this link&lt;/a>.&lt;/p>
&lt;p>In the video we show automated the creation of an observatory website with well-formatted, statistical data dissemination, a technical document in PDF and an ebook can be automated. In our view, our technology is particularly useful technology in business and scientific researech projects, where it is important that always the most timely and correct data is being analyzed, and remains automatically documented and cited. We are ready deploy public, collaborative, or private data observatories in short time.&lt;/p>
&lt;p>Data processing costs can be as high as 80% for any in-house AI deployment project. We work mainly with organization that do not have in house data science team, and acquire their data anyway from outside the organization. In their case, this rate can be as high as 95%, meaning that getting and processing the data for deploying AI can be 20x more expensive than the AI solution itself.&lt;/p>
&lt;p>AI solutions require a large amount of standardized, well processed data to learn from. We want to radically decrease the cost of data acquisition and processing for our users so that exploiting AI becomes in their reach. This is particularly important in one of our target industries, the music industries, where most of the global sales is algorithmic and AI-driven. Artists, bands, small labels, publishers, even small country national associations cannot remain competitive if they cannot participate in this technological revolution.&lt;/p>
&lt;p>We &lt;a href="https://dataobservatory.eu/post/2020-08-24-start-up/" target="_blank" rel="noopener">started&lt;/a> our operations on 1 September 2020 on the basis of &lt;a href="http://documentation.ceemid.eu/" target="_blank" rel="noopener">CEEMID&lt;/a>, a pan-European data observatory that created about 2000 music and creative industry indicators for its users. In the coming days, we are gradually opening up about 50 &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">music industry&lt;/a> and 50 broader creative industry indicators in a fully reproducible workflow, with daily re-freshed, re-processed, well-formatted and documented indicators for business and policy decisions.&lt;/p>
&lt;p>We would like to validate this approach in one of the world&amp;rsquo;s most prestigious university-backed incubator programs, in the &lt;a href="https://www.yesdelft.com/yes-programs/ai-blockchain-validation-lab/" target="_blank" rel="noopener">Yes!Delft AI/Blockchain Validation Lab&lt;/a>.&lt;/p>
&lt;h2 id="video-credits">Video credits&lt;/h2>
&lt;ul>
&lt;li>Data acquisition and processing: Daniel Antal, CFA and Marta Kołczyńska, PhD (&lt;a href="https://music.dataobservatory.eu/economy.html#demand" target="_blank" rel="noopener">survey data&lt;/a>).&lt;/li>
&lt;li>Documentation automation: Sandor Budai&lt;/li>
&lt;li>Video art: Line Matson&lt;/li>
&lt;li>Music: &lt;a href="https://www.youtube.com/moonmoonmoon" target="_blank" rel="noopener">Moon Moon Moon&lt;/a>.&lt;/li>
&lt;/ul></description></item><item><title>Starting-up</title><link>https://danielantal.eu/hu/post/2020-08-24-start-up/</link><pubDate>Mon, 24 Aug 2020 10:15:00 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-08-24-start-up/</guid><description>&lt;p>The big day has come: the co-founders singed off the documents at the public notary and started the registration of a reproducible research start-up in Leiden. We got a lot of support from our friends! Your encouragement gives us a lot of energy to accomplish our first milestones, and to get Reprex B.V. going!&lt;/p>
&lt;blockquote>
&lt;p>Reprex means &amp;lsquo;reproducible example&amp;rsquo; in data science. When you are stuck with a problem, creating a reproducible example allows other computer scientists, statisticians, programmers or data users to solve it. In 80% of the cases, you usually find the solution while creating a generalized example. In the 20% other cases, you can reach out for help easily.&lt;/p>
&lt;/blockquote>
&lt;p>In the coming days, we are launching demo versions of our headline products, data observatories. &lt;a href="https://music.dataobservatory.eu/index.html" target="_blank" rel="noopener">music.dataobservatory.eu&lt;/a> will be a fully automated online service that every day collects, processes, cleans, and publishes scientifically valid data about European music. Very soon after we will launch two other observatories.&lt;/p>
&lt;p>The creative and cultural sector, NGOs, most research institutions, data journalism teams are usually very small, and they do not have internal IT or data science capacities. We would like to provide them a transparent, high quality, and fully open source solution to acquire data, process it without errors, document it and make sense of it. We would like to embrace the idea of open collaboration among creative enterprises, scientific researchers, NGOs, data journalists and policymakers with our work.&lt;/p>
&lt;p>Our work will comply with the &lt;a href="https://www.bitss.org/opa/" target="_blank" rel="noopener">Open Policy Analysis&lt;/a> standards developed by the &lt;a href="https://www.bitss.org/" target="_blank" rel="noopener">Berkeley Initiative for Transparency in the Social Sciences&lt;/a> &amp;amp; &lt;a href="https://cega.berkeley.edu/" target="_blank" rel="noopener">Center for Effective Global Action&lt;/a> and the four principles of &lt;a href="http://dataobservatory.eu/reproducible/" target="_blank" rel="noopener">reproducible research&lt;/a>: reviewability, replicability, confirmability and auditability. We believe that these standards apply in reproducible finance, empirical evidence presentation in courts, or advocating sound policies and producing high-quality journalism.&lt;/p>
&lt;h2 id="help">Do you want to help our start?&lt;/h2>
&lt;p>We would like to enter into the Validation Lab of one of the best artificial intelligence incubators in early September. Talented team members, letters of intents and assignments from organizations will give a lot of credibility to our start &lt;a href="http://dataobservatory.eu/team/" target="_blank" rel="noopener">Meet our team »&lt;/a>.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Put as in contact with people who love to write code in R and interested in automating business and social science research and primary data collection such as surveying. &lt;a href="http://dataobservatory.eu/#featured" target="_blank" rel="noopener">Check out what sort of code we create »&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Introduce us to people who need data and information to make better informed decision and analysis in music, film, book publishing, photography services or socially responsible finance.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Share contacts of data journalists who would like to develop stories from big survey programs like &lt;a href="https://ec.europa.eu/commfrontoffice/publicopinion/index.cfm" target="_blank" rel="noopener">Eurobarometer&lt;/a>, &lt;a href="https://www.afrobarometer.org/" target="_blank" rel="noopener">Afrobarometer&lt;/a> and &lt;a href="https://www.latinobarometro.org/lat.jsp" target="_blank" rel="noopener">Lationbarometro&lt;/a>, or base their storytelling on data and its visualizations. &lt;a href="http://retroharmonize.satellitereport.com/" target="_blank" rel="noopener">See our survey harmonization examples »&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Do you know such people? Send over this post or connect us in an email or social media message!&lt;/p>
&lt;p>&lt;em>Thanks again for your good wishes and encouragements, and hope to hear from you soon!&lt;/em>&lt;/p></description></item></channel></rss>