Data Profiling Effiziente Entdeckung Struktureller Abhängigkeiten Dr. Thorsten Papenbrock - BTW 2019

Page created by Jessica Young

Arts & Entertainment

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

Data Profiling Effiziente Entdeckung Struktureller Abhängigkeiten Dr. Thorsten Papenbrock - BTW 2019

Data Profiling
Effiziente Entdeckung Struktureller Abhängigkeiten
                                                      Dr. Thorsten Papenbrock
                                                     Information Systems Group, HPI

Knowledge Discovery

What data do you have?

                                        Slide 2

Knowledge Discovery

Many companies do not know what data they have!
   Decentralized storage and retrieval
   Heterogeneous data formats and systems
   Unconnected sources
   Lack of metadata and integrity constraints
   Different access rights
   Data quality issues
   Complicated business processes
   Data backups and archives
   Data acquisition and sharing
   …                                                            Slide 3

CrowdFlower Data Science Report 2016

                                                                                                                   Knowledge Discovery

                                                                                                                      Data Analytics

                                                                                            ~80% on data preparation!

                     https://visit.figure-eight.com/rs/416-ZBE-142/images/CrowdFlower_DataScienceReport_2016.pdf

Data scientists spend most of their time on data preparation!
   Multiple, heterogeneous data sources
   Lack of metadata and documentation
   Data quality issues
   Data acquisition and sharing
   …                                                                                                                                  Slide 4

Deep Visual-Semantic Alignments for
    Generating Image Descriptions
    Andrej Karpathy and Li Fei-Fei, Stanford University, TPAMI, 2015
                                                                              Knowledge Discovery

                                                                                 Data Analytics

                                                                                   AI Systems

                                                                       AI systems learn what they
AI systems learn erroneous, non-interpretable behavior!                    see and understand

   Data quality issues
   Insufficient training data
   Heterogeneous data formats and systems
   Lack of metadata and documentation
   …                                                                                             Slide 5

Data Engineering for Data Science

                                    Knowledge Discovery

                                       Data Analytics

                                        AI Systems

                                             …
                                             …
                                              …
                                               …

                                      Application

                                                        Slide 6

Data Engineering for Data Science

                                                                  Knowledge Discovery
                                      Schema
                                    Engineering
                                                                     Data Analytics

                        Data
Data                 Exploration                  Data Cleaning       AI Systems

                                                                           …
                                                                           …
                                                                            …
                       Scientific Data          Data                         …
                       Management            Integration

                                   Preparation                      Application

                                                                                      Slide 7

Data Engineering for Data Science

 “The activity of collecting data about data”                                         Knowledge Discovery
   (statistics, dependencies, and layouts)                Schema
                                                        Engineering
                                                                                         Data Analytics

                                            Data
Data           Data Profiling                                         Data Cleaning       AI Systems
                                         Exploration

                   IND                                                                         …
                               MD                                                              …
                                                                                                …
                                            Scientific Data         Data                         …
                         FD                 Management           Integration
                                UCC

                                                       Preparation                      Application
             MVD
                          OD

               Metadata

                                                                                                          Slide 8

Data Profiling

  ID        Name          Evolution      Location        Sex    Weight   Size   Type       Weak       Strong     Special
  25       Pikachu          Raichu     Viridian Forest   m/w     6.0     0.4    electric   ground     water       false
  27      Sandshrew       Sandslash       Route 4        m/w     12.0    0.6    ground      gras      electric    false
  29       Nidoran         Nidorino     Safari Zone      m       9.0     0.5    poison     ground      gras       false
  32       Nidoran         Nidorina     Safari Zone       w      7.0     0.4    poison     ground      gras       false
  37        Vulpix         Ninetails      Route 7        m/w     9.9     0.6      fire     water        ice       false
  38       Ninetails          null          null         m/w     19.9    1.1      fire     water        ice       true
  63         Abra          Kadabra       Route 24        m/w     19.5    0.9    psychic    ghost      fighting    false
  64       Kadabra        Alakazam     Cerulean Cave     m/w     56.5    1.3    psychic    ghost      fighting    false
 130      Gyarados            null      Fuchsia City     m/w    235.0    6.5    water      electric     fire      false
 150       Mewtwo             null     Cerulean Cave     null   122.0    2.0    psychic    ghost      fighting    true

 http://bulbapedia.bulbagarden.net
                                                                                                                 Slide 9

Data Profiling
                                                 density             ranges                    aggregations                       distributions
                                                                                                                  3                  3               3
                                                 #null = _3              min = 0.4               sum = 14.3       2                  2               2
                                                                                                                  1                  1               1
                                                 %null = 30         max = 2.0                    avg = 1.43
         format                                                                                                   0                  0               0

          ID          Name         Evolution             Location                    Sex        Weight    Size        Type               Weak            Strong      Special
          25         Pikachu        Raichu             Viridian Forest               m/w         6.0      0.4         electric           ground          water        false
          27        Sandshrew      Sandslash              Route 4                    m/w         12.0     0.6         ground              gras           electric     false
          29         Nidoran       Nidorino              Safari Zone                 m           9.0      0.5         poison             ground           gras        false
          32         Nidoran       Nidorina              Safari Zone                  w          7.0      0.4         poison             ground           gras        false

size      37          Vulpix       Ninetails              Route 7                    m/w         9.9      0.6           fire             water             ice        false
# = 10    38         Ninetails       null                     null                   m/w         19.9     1.1           fire             water             ice        true
          63           Abra        Kadabra                Route 24                   m/w         19.5     0.9         psychic            ghost           fighting     false
          64         Kadabra       Alakazam            Cerulean Cave                 m/w         56.5     1.3         psychic            ghost           fighting     false
         130        Gyarados         null               Fuchsia City                 m/w        235.0     6.5         water              electric          fire       false
         150         Mewtwo          null              Cerulean Cave                 null       122.0     2.0         psychic            ghost           fighting     true

                                                                                                          FLOAT
                        CHAR(16)

                                                                                     CHAR(3)

                                                                                                                                                           CHAR(8)
                                      CHAR(16)

                                                                                                                                                                       BOOLEAN
          INTEGER

                                                                                                  FLOAT

                                                                                                                        CHAR(8)

                                                                                                                                           CHAR(8)
                                                              CHAR(32)

                                                                                                                                                                     Slide 10
                                                                                                 data types

Data Profiling
   inclusion dependencies                                                functional dependencies
   Pokemon.Location ⊆ Location.Name                                                Type  Weak

 ID       Name       Evolution          Location        Sex    Weight    Size   Type        Weak      Strong     Special
  25     Pikachu       Raichu         Viridian Forest   m/w     6.0       0.4   electric   ground     water       false
  27   Sandshrew      Sandslash          Route 4        m/w     12.0      0.6   ground      gras      electric    false
  29     Nidoran      Nidorino         Safari Zone      m       9.0       0.5   poison     ground      gras       false
  32     Nidoran      Nidorina         Safari Zone       w      7.0       0.4   poison     ground      gras       false
  37      Vulpix      Ninetails          Route 7        m/w     9.9       0.6     fire      water       ice       false
  38     Ninetails       null              null         m/w     19.9      1.1     fire      water       ice       true
  63       Abra       Kadabra           Route 24        m/w     19.5      0.9   psychic     ghost     fighting    false
  64     Kadabra      Alakazam        Cerulean Cave     m/w     56.5      1.3   psychic     ghost     fighting    false
 130    Gyarados         null          Fuchsia City     m/w    235.0      6.5   water      electric     fire      false
 150     Mewtwo          null         Cerulean Cave     null   122.0      2.0   psychic     ghost     fighting    true

                      {Name, Sex}                               Weight ↓ Size                Weak ≠ Strong
        unique column combinations                       order dependencies              denial constraintsSlide 11

Data Profiling

                                                                            Type  Weak

 ID      Name       Evolution     Location        Sex    Weight   Size   Type       Weak       Strong     Special
  25    Pikachu      Raichu     Viridian Forest   m/w     6.0     0.4    electric   ground     water       false
  27   Sandshrew    Sandslash      Route 4        m/w     12.0    0.6    ground      gras      electric    false
  29    Nidoran     Nidorino     Safari Zone      m       9.0     0.5    poison     ground      gras       false
  32    Nidoran     Nidorina     Safari Zone       w      7.0     0.4    poison     ground      gras       false
  37     Vulpix     Ninetails      Route 7        m/w     9.9     0.6      fire     water        ice       false
  38    Ninetails     null           null         m/w     19.9    1.1      fire     water        ice       true
  63      Abra      Kadabra       Route 24        m/w     19.5    0.9    psychic    ghost      fighting    false
  64    Kadabra     Alakazam    Cerulean Cave     m/w     56.5    1.3    psychic    ghost      fighting    false
 130   Gyarados       null       Fuchsia City     m/w    235.0    6.5    water      electric     fire      false
 150    Mewtwo        null      Cerulean Cave     null   122.0    2.0    psychic    ghost      fighting    true

                                                                                                          Slide 12

Data Profiling    ~8 million records

  94 attributes

2013       2014   2015           2016          2017             2018

                     Definition: Given a relational instance r for a
                     schema R. The functional dependency X → A
    Functional       with X ⊆ R and A ∈ R is valid in r, iff
   Dependencies      ∀ti, tj ∈ r : ti[X] = tj[X] ⇒ ti[Y] = tj[Y].

                         “The values in X functionally define the values in Y”
       X Y1Y2

                                        Type → Weak, Strong   GYM → Leader, Reward
                                                                                     Slide 14

2013       2014   2015           2016          2017          2018

                     Definition: Given a relational instance r for a
                     schema R. The functional dependency X → A
    Functional       with X ⊆ R and A ∈ R is valid in r, iff
   Dependencies      ∀ti, tj ∈ r : ti[X] = tj[X] ⇒ ti[Y] = tj[Y].

                         “The values in X functionally define the values in Y”
       X Y1Y2

                                                                                 Slide 15

2013             2014              2015             2016             2017             2018

[TANE] TANE: An efficient algorithm for discovering functional and approximate dependencies,
Ykä Huhtala, Juha Kärkkäinen, Pasi Porkka and Hannu Toivonen, The Computer Journal, 1999.

[FUN] FUN: An efficient algorithm for mining functional and embedded dependencies,
Noël Novelli and Rosine Cicchetti, ICDT, 2001.

[FD_MINE] FD Mine: discovering functional dependencies in a database using equivalences,
Hong Yao, Howard J Hamilton and Cory J Butz, ICDM, 2002.

[DFD] DFD: Efficient Functional Dependency Discovery,
Ziawasch Abedjan, Patrick Schulze and Felix Naumann, CIKM, 2014.

[DEP-MINER] Efficient discovery of functional dependencies and Armstrong relations,
Stéphane Lopes, Jean-Marc Petit and Lotfi Lakhal, EDBT, 2000.

[FASTFDS] FastFDs: A heuristic-driven, depth-first algorithm for mining functional dependencies from relation instances,
Catharine Wyss, Chris Giannella and Edward Robertson, DaWaK, 2001.

[FDEP] Database dependency discovery: a machine learning approach,
Peter A Flach and Iztok Savnik, AI Communications, 1999.
                                                                                                                   Slide 16

2013   2014   2015   2016   2017   2018

                                          Slide 17

2013              2014              2015              2016               2017              2018

[Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms, T. Papenbrock et. al., VLDB, 2015]   Slide 18

2013        2014   2015   2016   2017   2018

  Inclusion
Dependencies

  X     Y

                                               Slide 19

2013              2014             2015             2016              2017             2018

                                    records                         FDs
                       dataset                      results
                         plis,                             plis,
     HyFD            pliRecords            Data         pliRecords                     Symbols
                                    Preprocessor
                Record Pair     comparisonSuggestions               FD
                                                                    UCC                           Main
                                                                    UCC
                                                                     UCC
                                                                      FD
               Sampler                                         Validator
                                                               Validator
                                                               Validator
                                                                Validator
                                                                Validator                         Side
                       non-FDs

               FD Candidate          candidate-FDs                                 Components:
               Inductor                                                                           Main
                                         Memory
                                                                                                  Optional
                                       Guardian

[A Hybrid Approach to Functional Dependency Discovery, T. Papenbrock, F. Naumann, SIGMOD, 2016]              Slide 20

2013           2014               2015            2016              2017             2018

                                                                                          NSPCM
Name         Surname   Postcode    City        Mayor
                                                             4
                                                                          NSPM NSPC NSCM NPCM SPCM
Thomas       Miller    14482       Potsdam     Jakobs

Sarah        Miller    14482       Potsdam     Jakobs
                                                             3

Peter        Smith     60329       Frankfurt   Feldmann
                                                                 NSP NSM NSC NPC NCM NPM SPC SPM SCM PCM

Jasmine      Cone      01069       Dresden     Orosz         2
Thomas       Cone      14482       Potsdam     Jakobs               NS    NP NC NM SP             SC SM PC   PM CM
Mike         Moore     60329       Frankfurt   Feldmann      1

         Surname, Postcode, City, Mayor  Name                               N      S       P       C   M
         Name, Postcode, City, Mayor  Surname
         Surname  Name, Postcode, City, Mayor

                          Postcode  City
                          Postcode  Mayor
                          …

[A Hybrid Approach to Functional Dependency Discovery, T. Papenbrock, F. Naumann, SIGMOD, 2016]                  Slide 21

2013              2014             2015             2016              2017             2018

                                    records                         FDs
                       dataset                      results
                         plis,                             plis,
     HyFD            pliRecords            Data         pliRecords                     Symbols
                                    Preprocessor
                Record Pair     comparisonSuggestions               FD
                                                                    UCC                           Main
                                                                    UCC
                                                                     UCC
                                                                      FD
               Sampler                                         Validator
                                                               Validator
                                                                Validator
                                                                Validator
                                                                Validator                         Side
                       non-FDs

               FD Candidate          candidate-FDs                                 Components:
               Inductor                                                                           Main
                                         Memory
                                                                                                  Optional
                                       Guardian

[A Hybrid Approach to Functional Dependency Discovery, T. Papenbrock, F. Naumann, SIGMOD, 2016]              Slide 22

2013             2014              2015             2016              2017             2018

[A Hybrid Approach to Functional Dependency Discovery, T. Papenbrock, F. Naumann, SIGMOD, 2016]     Slide 23

2013             2014              2015             2016              2017             2018

[A Hybrid Approach to Functional Dependency Discovery, T. Papenbrock, F. Naumann, SIGMOD, 2016]     Slide 24

2013   2014   2015                2016         2017      2018

                     Dependency                        Algorithms   Algorithms
                                                       (exact)      (approximate)
                     Unique Column Combination (UCC)       2              0

                     Inclusion Dependency (IND)            5              2

                     Functional Dependency (FD)            8              2

                     Order Dependency (OD)                 1              0

                     Matching Dependency (MD)              2              0

                     Multi-valued Dependency (MvD)         1              0

                     Denial Constraints (DC)               1              0

                     Statistics                            1             13

                                                           21            17

                                                                              Slide 25

2013              2014              2015              2016              2017              2018

[Data Profiling with Metanome, T. Papenbrock, T. Bergmann, M. Finke, J. Zwiener, F. Naumann, VLDB, 2015]   Slide 26

2013              2014              2015                2016            2017              2018

                    Algorithm execution                       Algorithm configuration
                    Result & Resource                         Result & Resource
                     management                                 presentation

            Configuration                                                    HyFD      DUCC
                                                        txt    tsv            jar       jar
            Resource Links             DB2
                                              DB2
                                      MySQL                   csv      SPIDER       DFD     BINDER
            Results                                    xml
                                                                         jar        jar       jar

[Data Profiling with Metanome, T. Papenbrock, T. Bergmann, M. Finke, J. Zwiener, F. Naumann, VLDB, 2015]   Slide 27

Metanome Use
               Slide 28

Metanome
Algorithm Research                                       Tool Development
■ Anja Jentzsch          (RDF)
■ Arvid Heise            (UCC)                                                                    Jakob Zwiener
                                                                                               (Backend & Frontend)
■ Fabian Tschirschnitz   (IND)
■ Felix Naumann          (Research lead)
■ Hazar Harmouch         (Single Column Profiling)
■ Jens Ehrlich           (Conditional UCC)
■ Jorge-Arnulfo          (UCC, IND)
■ Maximilian Grundke     (Conditional FD)
                                                             Claudia Exeler   Tanja Bergmann        Moritz Finke
■ Moritz Finke           (Approximate FD/IND)
                                                               (Frontend)   (Backend & Frontend)     (Backend)
■ Philipp Langer         (OD)
■ Philipp Schirmer       (MVD)
■ Sebastian Kruse        (IND, partial FD; Metadata Store)
■ Thorsten Papenbrock    (IND, UCC, FD, …; Metanome)
■ Tim Draeger            (MVD)
■ Tobias Bleifuß         (DC)                                                    Maxi Fischer    Vincent Schwarzer
                                                             Carl Ambroselli
■ Ziawasch Abedjan       (UCC)                                 (Frontend)    (Backend & Frontend)    (Backend)
                                                                                                            Slide 29

2019   DynFD: Functional Dependency Discovery in Dynamic Datasets
       P. Schirmer, T. Papenbrock, S. Kruse, D. Hempfing, T. Mayer, D. Neuschäfer-Rube, F. Naumann          (EDBT)
       An Actor Database System for Akka
       S. Schmidl, F. Schneider, T. Papenbrock                                                              (BTW)
2018   Data Profiling – Synthesis Lectures on Data Management
       Z. Abedjan, L. Golab, F. Naumann, T. Papenbrock                                                      (Morgan & Claypool)
2017   Detecting Inclusion Dependencies on Very Many Tables
       F. Tschirschnitz, T. Papenbrock, F. Naumann                                                          (TODS)
       Data-driven Schema Normalization
       T. Papenbrock, F. Naumann                                                                            (EDBT)
       A Hybrid Approach for Efficient Unique Column Combination Discovery
       T. Papenbrock, F. Naumann                                                                            (BTW)
       Fast Approximate Discovery of Inclusion Dependencies
       S. Kruse, T. Papenbrock, C. Dullweber, M. Finke, M. Hegner, M. Zabel, C. Zöllner, F. Naumann         (BTW)
2016   A Hybrid Approach to Functional Dependency Discovery
       T. Papenbrock, F. Naumann                                                                            (SIGMOD)
       Data Anamnesis: Admitting Raw Data into an Organization
       S. Kruse, T. Papenbrock, H. Harmouch, F. Naumann                                                     (IEEE Data Engineering Bulletin)
       Holistic Data Profiling: Simultaneous Discovery of Various Metadata
       J. Ehrlich, M. Roick, L. Schulze, J. Zwiener, T. Papenbrock, F. Naumann                              (EDBT)
       RDFind: Scalable Conditional Inclusion Dependency Discovery in RDF Datasets
       S. Kruse, A. Jentzsch, T. Papenbrock, Z. Kaoudi, J. Quiané-Ruiz, F. Naumann                          (SIGMOD)
       Approximate Discovery of Functional Dependencies for Large Datasets
       T. Bleifuß, S. Bülow, J. Frohnhofen, J. Risch, G. Wiese, S. Kruse, T. Papenbrock, F. Naumann         (CIKM)
2015   Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms
       T. Papenbrock, J. Ehrlich, J. Marten, T. Neubert, J. Rudolph, M. Schönberg, J. Zwiener, F. Naumann   (VLDB)
       Data Profiling with Metanome
       T. Papenbrock, T. Bergmann, M. Finke, J. Zwiener, F. Naumann                                         (VLDB)
       Divide & Conquer-based Inclusion Dependency Discovery
       T. Papenbrock, S. Kruse, J. Quiané-Ruiz, F. Naumann                                                  (VLDB)
       Scaling Out the Discovery of Inclusion Dependencies
       S. Kruse, T. Papenbrock, F. Naumann                                                                  (BTW)
       Progressive Duplicate Detection
       T. Papenbrock, A. Heise, F. Naumann                                                                  (TKDE)
2013   Ein Datenbankkurs mit 6000 Teilnehmern
       F. Naumann, M. Jenders, T. Papenbrock                                                                (Informatik-Spektrum)
       Duplicate Detection on GPUs
       B. Forchhammer, T. Papenbrock, T. Stening, S. Viehmeier, U. Draisbach, F. Naumann                    (BTW)
2011   BlackSwan: Augmenting Statistics with Event Data
       J. Lorey, F. Naumann, B. Forchhammer, A. Mascher, P. Retzlaff, A. Zamani Farahani, S. Discher,
       C. Fähnrich, S. Lemme, T. Papenbrock, R. C. Peschel, S. Richter, T. Stening, S. Viehmeier            (CIKM)

Discovery      Application

Distribution   Robustness

You can also read

NSW Strategic Investment Plan for Water Monitoring Systems - Water Resource Monitoring 2012-2017 July 2012

Package 'arsenal' February 15, 2020 - R Project

Package 'scorecard' August 30, 2020 - CRAN

Package 'xpose' July 28, 2018 - CRAN-R

Package 'googleway' - CRAN.R-project.org

Data for Water Security: Improving Water Data Access and Use

Package 'SGP' February 20, 2019 - The R Project for Statistical Computing

Reportable Rest Services Interfaces

Package 'ctmm' May 7, 2020 - CRAN

Data Access Accountability - Who Did What to Your Data When? - A Lumigent Data Access Accountability Series White Paper

ALERT Special for embassies - The General Data Protection Regulation applicable as of 25 May 2018 - Utrecht University Repository

ESA EO Data Access Training course 2021 - Raffaele Rigoli, Amalia Castro, Francesco Sarti 19/03/2021 - eo science for ...

2021 Virtual Training Prospectus - Data. Well told. storyiq.com

COMMUNICATIONS ALLIANCE LTD INDUSTRY GUIDANCE NOTE IGN 019 IPND RECONCILIATION DATA EXTRACT AND DATA PROVIDER UPLOAD VALIDATION PROCESSES

Our performance data ERM Sustainability Report 2021

TECHNICAL REPORT "OUT OF CONTROL" - A REVIEW OF DATA SHARING BY POPULAR MOBILE APPS - Norwegian Consumer Council Place Date Version Authors

Life Cycle Cost Analysis - 2021 General Electric Company

Data class actions in Europe - and spotlights in Mexico, Russia and the U.S. Here's what you should know, and how you should prepare to defend ...

DELIVERABLE 5.3.1 Clinical Business Information Systems - NewCompliance

RESPONSIBLE MARKETING WITH FIRST-PARTY DATA - Think with Google

Using External and Local Policy with Pexip Infinity Deployment Guide - Software Version 25 Document Version 25.a January 2021

Briefing to the Incoming Minister November 2020 - Beehive ...

TERTIARY STUDY PATHWAY ADVICE USING ARTIFICIAL INTELLIGENCE SYSTEMS BASED ON MĀORI DATA

Yet another set of comments on "Capital in the Twenty-First Century"

EMISSION FACTORS 2019 DATABASE DOCUMENTATION - wds.iea.org.

An Approach for Forecast Prediction in Data Analytics Field by Tableau Software - MECS Press

Racket on the Playstation 3? It's not what you think - Dan Liebgold RacketCon 2013

How Companies can turn their MDM into Next Generation Data Management in 2021 The role of SAP MDG and data capabilities in uncertain times and ...

COVID-19 Health Data Research - 8 June 2021 - Fortnightly update for SAGE, National Core Studies & UKRI/DHSC - HDR UK

Guideline No. 497 Guideline on Defined Approaches for Skin Sensitisation - Gov.pl

Achieve Operational Efficiency in Car Manufacturing with Advanced Analytics - Dr. Sebastian Schmerl | Solution Manager Cyber Defense for ...

Improving the Quality of Central Government Expenditure Data Used in the National Accounts: A 2008 Update

Standards and Best Practices for Data Sources - Health Information System Strengthening: MEASURE ...

BIS Working Papers Big data and machine learning in central banking by Sebastian Doerr, Leonardo Gambacorta and Jose Maria Serena - Bank for ...

User and installation manual - aquastream ULTIMATE

ECS Overview and Architecture - A Dell EMC Technical Whitepaper - September 2018

Preparing Post 16 Data for the School Census Autumn 2020 Return - English Schools with a Sixth Form

TVP3010C, TVP3010M Data Manual - Video Interface Palette

BREATHING SPACE HOW TO TRACK AND REPORT AIR POLLUTION UNDER THE NATIONAL CLEAN AIR PROGRAMME

Changes in data ownership and usage - Preliminary thoughts Katrina Ligett and Kobbi Nissim

CORPORATE STRATEGY 2016-2021 - HESA

Big Data and Hadoop with components like Flume, Pig, Hive and Jaql

A GENERIC FRAMEWORK FOR ENGAGING ONLINE DATA SOURCES IN INTRODUCTORY PROGRAMMING COURSES - NADEEM ABDUL HAMID - Sinbad.key

Best Practices for Library Linked Open Data (LOD) Publication - LIBER Linked Open Data (LOD) Working Group

Multi-channel VideoStreaming Technology for Processing Based on Distributed Computing Platform - IOPscience

WEATHER RADAR DATA REQUIREMENTS FOR CLIMATE MONITORING - GCOS-223 - WORLD METEOROLOGICAL

Year In Review 2019 Office of Data Analysis and Strategy Division of Academic Affairs - East ...

WORKING DOCUMENT, MARCH 2019 - World Health Organization

Statement of Direction 2018 - Innovation in the Analytics Economy - Qlik

DataInk: Direct and Creative Data-Oriented Drawing