<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>R | Antal Dániel honlapja</title><link>https://danielantal.eu/hu/tag/r/</link><atom:link href="https://danielantal.eu/hu/tag/r/index.xml" rel="self" type="application/rss+xml"/><description>R</description><generator>Wowchemy (https://wowchemy.com)</generator><language>hu</language><lastBuildDate>Mon, 21 Sep 2020 11:31:39 +0000</lastBuildDate><image><url>https://danielantal.eu/media/icon_hub9491570ac57158c0eeecc95c95b13e5_20247_512x512_fill_lanczos_center_3.png</url><title>R</title><link>https://danielantal.eu/hu/tag/r/</link></image><item><title>Reproducible Survey Harmonization: retroharmonize Is Released</title><link>https://danielantal.eu/hu/post/2020-09-21-retroharmonize_release/</link><pubDate>Mon, 21 Sep 2020 11:31:39 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-09-21-retroharmonize_release/</guid><description>&lt;p>Our original intention was to make surveying more accessible for music and creative industry partners, by relying more on already existing survey data, and better designing complementary, smaller surveys, becasue surveying, opinion polling is becoming increasingly expensive in the develop world. People are less and less likely to sit down for an interview in their houses. We have tried to harmonize our custom surveys, particuarly with Kantar in Hungary and Focus in Slovakia with exisiting EU projects. But we ended up making a part of international survey harmonization across countries and throughout years easier to automate.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://danielantal.eu/img/packages/ab_plot1.png" alt="Harmonized results from Afrobarometer" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;p>Surveys are like sensors for natural sciences and industrial production. They are essential for almost any social and economic statistical indicator, for calculating the inflation, parts of the GDP, participation in education programs. Making surveys easier to harmonize and exploit more already existing survey data can bring down research cost, and can increase research value at the same time. (See our earlier blog post &lt;a href="https://dataobservatory.eu/post/2020-07-10-retroharmonize/" target="_blank" rel="noopener">Increase The Value Of Market Research With Open Data And Survey Harmonization&lt;/a>.)&lt;/p>
&lt;p>So, if you are an R user, you can use &lt;code>install.packages(“retroharmonize”)&lt;/code> to get the released 0.1.13 version and make tutorials with real Eurobarometer or Afrobarometer microdata. With &lt;code>devtools::install_github(&amp;quot;antaldaniel/retroharmonize&amp;quot;)&lt;/code> you can already install the current development version 0.1.14, which handles perl-like regex, which will be necessary for our next tutorial in the making for &lt;a href="https://www.arabbarometer.org/" target="_blank" rel="noopener">Arab Barometer&lt;/a>.&lt;/p>
&lt;p>&lt;strong>Related&lt;/strong>:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">retroharmonize package website&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://github.com/antaldaniel/retroharmonize/" target="_blank" rel="noopener">retroharmonize on github&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul></description></item><item><title>Launching Our Demo Music Observatory</title><link>https://danielantal.eu/hu/post/2020-09-15-music-observatory-launch/</link><pubDate>Tue, 15 Sep 2020 08:00:39 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-09-15-music-observatory-launch/</guid><description>&lt;p>Today, on 15 September 2020, we officially launched our &lt;code>minimal viable product&lt;/code> as we promised to partners back in February. This was a particularly difficult period for everybody. We aspired to deliver by September in a very different environment, our hopes for commissioned work went up in flames with the pandemic, and our targeted users, musicians and music entrepreneurs, talent managers, music venues lost most of their income. The organizations helping them, granting authorities, export offices and collective management societies are overwhelmed with the problem. During these troublesome times, our team expanded, attracted great new talent, and kept working.&lt;/p>
&lt;p>Our first product is the &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Demo Music Observatory&lt;/a>, a collaborative, automated research-based &lt;a href="https://dataobservatory.eu/faq/observatories/" target="_blank" rel="noopener">observatory&lt;/a> for the music industry, one that is particularly hard hit by the COVID19 crisis. Not only great artists, composers, technicians, managers fell victim to the virus, but musicians lost about 50–90% of their income from live music. This translates to a 100% loss for the live music technicians and managers.&lt;/p>
&lt;p>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/fQJHflWPS34" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
See our &lt;a href="https://dataobservatory.eu/post/2020-09-11-creating-automated-observatory/" target="_blank" rel="noopener">earlier blogpost&lt;/a> on what you see on the video.&lt;/p>
&lt;p>The music industry was never a place for great job security. For putting up a show, you usually need a network of 10–200 artists, technicians and managers to work together as freelancers without all those social benefits that many people enjoy in other walks of life. We have been trying to figure out how to help this microenterprise and freelancer-network based industry with research for five years. Our aim is to make them competitive when they are talking with their buyers: Google, Apple, Spotify, who are really heavy-weight data and AI pros. Our better plan their tours, when they will be back on the road, to understand what sort of audiences and purchasing power waits for them in different European cities.&lt;/p>
&lt;p>We are launching at a time when the music industry is crying for help.Therefore, we have decided to make our demo observatory open and unfinished. Over the last 7 years, we have built up about 2000 music and creative sector indicators to be used for business KPIs, forecasting targets, grant evaluations, royalty valuations, concert demography target group analysis and other professional uses. We would like to open up, based on your needs, about 50 well-designed indicators, and pledge to keep it daily refreshed, corrected, documented, citaable, downloadable. Also, feel free to use our most valuable source code—use it for your own purposes, even modify it, as long as you keep it open.&lt;/p>
&lt;p>For our smaller partners, we follow what musicians do these days on Bandcamp: name your price. We make a pledge to our small partners: if you need reliable data to plan your next grant calls, calculate royalties, compensations, predict hit candidates, give us the job—and name your price. Post-corona, you can take for a dollar the best music from Bandcamp. You can take our research products, for a limited period, for any amount you name, as long as it is for a good cause and serves the industry, musicians, technicians or managers. In return, we ask for your feedback. Help us validate whether we are on the right track, tell us how we can cooperate after the pandemic, in better times.&lt;/p>
&lt;p>Our larger and better funded partners? We ask you to pay the price we name, because we believe that it is a well-justified, fair and competitive price, set by pricing experts.&lt;/p>
&lt;p>We appreciate it if you take a look at our offering, or if you pass this blogpost on to your colleagues in the industry. Our main target audience initially are music professional in broader Europe, but we are planning to cover all major global markets very soon, too. Feedback from the U.S., Australia, Canada, Colombia, Brazil &amp;amp; Argentina is particularly welcome as we have great plans over there!&lt;/p>
&lt;h2 id="who-we-are">Who we are?&lt;/h2>
&lt;p>We &lt;a href="https://dataobservatory.eu/post/2020-08-24-start-up/" target="_blank" rel="noopener">started&lt;/a> our operations on 1 September 2020 on the basis of &lt;a href="http://documentation.ceemid.eu/" target="_blank" rel="noopener">CEEMID&lt;/a>, a pan-European data observatory that created about 2000 music and creative industry indicators for its users. In the coming days, we are gradually opening up about 50 &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">music industry&lt;/a> and 50 broader creative industry indicators in a fully reproducible workflow, with daily re-freshed, re-processed, well-formatted and documented indicators for business and policy decisions.&lt;/p>
&lt;p>We would like to validate this approach in one of the world&amp;rsquo;s most prestigious university-backed incubator programs, in the &lt;a href="https://www.yesdelft.com/yes-programs/ai-blockchain-validation-lab/" target="_blank" rel="noopener">Yes!Delft AI/Blockchain Validation Lab&lt;/a>. We&amp;rsquo;re finalist on their selection, and all help before 23 September from our friends in the music industry is more than appreciated. If we get there, we can rely on probably the best pros in Europe to make our offering better tailored and financially sustainable.&lt;/p>
&lt;h2 id="get-in-touch">Get in touch!&lt;/h2>
&lt;p>We use the very simple and extremely secure &lt;strong>keybase.io&lt;/strong>, a kind of mix of Whatsapp, Skype, Google Drive, One Drive and zoom. You can get in touch on that platform with us in anytime &lt;a href="https://keybase.io/team/reprexcommunity" target="_blank" rel="noopener">here&lt;/a>.&lt;/p>
&lt;p>You can easily contact on LinkedIn &lt;a href="https://www.linkedin.com/in/antaldaniel/" target="_blank" rel="noopener">Daniel&lt;/a> or &lt;a href="https://www.linkedin.com/in/k%C3%A1tya-nagy-a9447730/" target="_blank" rel="noopener">Kátya&lt;/a> and of course, we have a usually working &lt;a href="https://dataobservatory.eu/#about" target="_blank" rel="noopener">email contact form&lt;/a>, too. Our email is name.surname at our main domain.&lt;/p>
&lt;h2 id="video-credits">Video credits&lt;/h2>
&lt;ul>
&lt;li>Data acquisition and processing: Daniel Antal, CFA and Marta Kołczyńska, PhD (&lt;a href="https://music.dataobservatory.eu/economy.html#demand" target="_blank" rel="noopener">survey data&lt;/a>).&lt;/li>
&lt;li>Documentation automation: Sandor Budai&lt;/li>
&lt;li>Video art: Line Matson&lt;/li>
&lt;li>Music: &lt;a href="https://www.youtube.com/moonmoonmoon" target="_blank" rel="noopener">Moon Moon Moon&lt;/a>.&lt;/li>
&lt;/ul></description></item><item><title>Creating An Automated Data Observatory</title><link>https://danielantal.eu/hu/post/2020-09-11-creating-automated-observatory/</link><pubDate>Fri, 11 Sep 2020 16:00:39 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-09-11-creating-automated-observatory/</guid><description>&lt;p>We are building data ecosystems, so called observatories, where scientific, business, policy and civic users can find factual information, data, evidence for their domain. Our open source, open data, open collaboration approach allows to connect various open and proprietary data sources, and our reproducible research workflows allow us to automate data collection, processing, publication, documentation and presentation.&lt;/p>
&lt;p>Our scripts are checking data sources, such as Eurostat&amp;rsquo;s Eurobase, Spotify&amp;rsquo;s API and other music industry sources every day for new information, and process any data corrections or new disclosure, interpolate, backcast or forecast missing values, make currency translations and unit conversions. This is shown illustrated with an &lt;a href="https://dataobservatory.eu/post/2020-07-25-reproducible_ingestion/" target="_blank" rel="noopener">earlier post&lt;/a>.&lt;/p>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/fQJHflWPS34" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
&lt;p>For direct access to the file visit &lt;a href="https://dataobservatory.eu/video/making-of-dmo.mp4" target="_blank" rel="noopener">this link&lt;/a>.&lt;/p>
&lt;p>In the video we show automated the creation of an observatory website with well-formatted, statistical data dissemination, a technical document in PDF and an ebook can be automated. In our view, our technology is particularly useful technology in business and scientific researech projects, where it is important that always the most timely and correct data is being analyzed, and remains automatically documented and cited. We are ready deploy public, collaborative, or private data observatories in short time.&lt;/p>
&lt;p>Data processing costs can be as high as 80% for any in-house AI deployment project. We work mainly with organization that do not have in house data science team, and acquire their data anyway from outside the organization. In their case, this rate can be as high as 95%, meaning that getting and processing the data for deploying AI can be 20x more expensive than the AI solution itself.&lt;/p>
&lt;p>AI solutions require a large amount of standardized, well processed data to learn from. We want to radically decrease the cost of data acquisition and processing for our users so that exploiting AI becomes in their reach. This is particularly important in one of our target industries, the music industries, where most of the global sales is algorithmic and AI-driven. Artists, bands, small labels, publishers, even small country national associations cannot remain competitive if they cannot participate in this technological revolution.&lt;/p>
&lt;p>We &lt;a href="https://dataobservatory.eu/post/2020-08-24-start-up/" target="_blank" rel="noopener">started&lt;/a> our operations on 1 September 2020 on the basis of &lt;a href="http://documentation.ceemid.eu/" target="_blank" rel="noopener">CEEMID&lt;/a>, a pan-European data observatory that created about 2000 music and creative industry indicators for its users. In the coming days, we are gradually opening up about 50 &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">music industry&lt;/a> and 50 broader creative industry indicators in a fully reproducible workflow, with daily re-freshed, re-processed, well-formatted and documented indicators for business and policy decisions.&lt;/p>
&lt;p>We would like to validate this approach in one of the world&amp;rsquo;s most prestigious university-backed incubator programs, in the &lt;a href="https://www.yesdelft.com/yes-programs/ai-blockchain-validation-lab/" target="_blank" rel="noopener">Yes!Delft AI/Blockchain Validation Lab&lt;/a>.&lt;/p>
&lt;h2 id="video-credits">Video credits&lt;/h2>
&lt;ul>
&lt;li>Data acquisition and processing: Daniel Antal, CFA and Marta Kołczyńska, PhD (&lt;a href="https://music.dataobservatory.eu/economy.html#demand" target="_blank" rel="noopener">survey data&lt;/a>).&lt;/li>
&lt;li>Documentation automation: Sandor Budai&lt;/li>
&lt;li>Video art: Line Matson&lt;/li>
&lt;li>Music: &lt;a href="https://www.youtube.com/moonmoonmoon" target="_blank" rel="noopener">Moon Moon Moon&lt;/a>.&lt;/li>
&lt;/ul></description></item><item><title>Starting-up</title><link>https://danielantal.eu/hu/post/2020-08-24-start-up/</link><pubDate>Mon, 24 Aug 2020 10:15:00 +0000</pubDate><guid>https://danielantal.eu/hu/post/2020-08-24-start-up/</guid><description>&lt;p>The big day has come: the co-founders singed off the documents at the public notary and started the registration of a reproducible research start-up in Leiden. We got a lot of support from our friends! Your encouragement gives us a lot of energy to accomplish our first milestones, and to get Reprex B.V. going!&lt;/p>
&lt;blockquote>
&lt;p>Reprex means &amp;lsquo;reproducible example&amp;rsquo; in data science. When you are stuck with a problem, creating a reproducible example allows other computer scientists, statisticians, programmers or data users to solve it. In 80% of the cases, you usually find the solution while creating a generalized example. In the 20% other cases, you can reach out for help easily.&lt;/p>
&lt;/blockquote>
&lt;p>In the coming days, we are launching demo versions of our headline products, data observatories. &lt;a href="https://music.dataobservatory.eu/index.html" target="_blank" rel="noopener">music.dataobservatory.eu&lt;/a> will be a fully automated online service that every day collects, processes, cleans, and publishes scientifically valid data about European music. Very soon after we will launch two other observatories.&lt;/p>
&lt;p>The creative and cultural sector, NGOs, most research institutions, data journalism teams are usually very small, and they do not have internal IT or data science capacities. We would like to provide them a transparent, high quality, and fully open source solution to acquire data, process it without errors, document it and make sense of it. We would like to embrace the idea of open collaboration among creative enterprises, scientific researchers, NGOs, data journalists and policymakers with our work.&lt;/p>
&lt;p>Our work will comply with the &lt;a href="https://www.bitss.org/opa/" target="_blank" rel="noopener">Open Policy Analysis&lt;/a> standards developed by the &lt;a href="https://www.bitss.org/" target="_blank" rel="noopener">Berkeley Initiative for Transparency in the Social Sciences&lt;/a> &amp;amp; &lt;a href="https://cega.berkeley.edu/" target="_blank" rel="noopener">Center for Effective Global Action&lt;/a> and the four principles of &lt;a href="http://dataobservatory.eu/reproducible/" target="_blank" rel="noopener">reproducible research&lt;/a>: reviewability, replicability, confirmability and auditability. We believe that these standards apply in reproducible finance, empirical evidence presentation in courts, or advocating sound policies and producing high-quality journalism.&lt;/p>
&lt;h2 id="help">Do you want to help our start?&lt;/h2>
&lt;p>We would like to enter into the Validation Lab of one of the best artificial intelligence incubators in early September. Talented team members, letters of intents and assignments from organizations will give a lot of credibility to our start &lt;a href="http://dataobservatory.eu/team/" target="_blank" rel="noopener">Meet our team »&lt;/a>.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Put as in contact with people who love to write code in R and interested in automating business and social science research and primary data collection such as surveying. &lt;a href="http://dataobservatory.eu/#featured" target="_blank" rel="noopener">Check out what sort of code we create »&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Introduce us to people who need data and information to make better informed decision and analysis in music, film, book publishing, photography services or socially responsible finance.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Share contacts of data journalists who would like to develop stories from big survey programs like &lt;a href="https://ec.europa.eu/commfrontoffice/publicopinion/index.cfm" target="_blank" rel="noopener">Eurobarometer&lt;/a>, &lt;a href="https://www.afrobarometer.org/" target="_blank" rel="noopener">Afrobarometer&lt;/a> and &lt;a href="https://www.latinobarometro.org/lat.jsp" target="_blank" rel="noopener">Lationbarometro&lt;/a>, or base their storytelling on data and its visualizations. &lt;a href="http://retroharmonize.satellitereport.com/" target="_blank" rel="noopener">See our survey harmonization examples »&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Do you know such people? Send over this post or connect us in an email or social media message!&lt;/p>
&lt;p>&lt;em>Thanks again for your good wishes and encouragements, and hope to hear from you soon!&lt;/em>&lt;/p></description></item></channel></rss>