Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...

Page created by Rodney Fischer
 
CONTINUE READING
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
Decolonizing
the Internet's
Structured Data

SUMMARY REPORT
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
IN OCTOBER 2021, OVER 40 PARTICIPANTS FROM AROUND THE WORLD WERE BROUGHT
                                                  TOGETHER FOR A CONVERSATION ABOUT DECOLONIZING THE INTERNET’S STRUCTURED
                                                   DATA, LEADING UP TO WIKIDATACON. THE EVENT WAS JOINTLY ORGANIZED BY WHOSE
                                                    KNOWLEDGE?, WIKI MOVIMENTO BRASIL, AND WIKIMEDIA DEUTSCHLAND. THIS IS THE
                                                   REPORT BACK FROM OUR GATHERING AND THE REFLECTIONS THAT EMERGED FROM IT.

Image by Whose Knowledge?, CC BY-SA 4.0, via Wikimedia Commons                                                         Image by Whose Knowledge?, CC BY-SA 4.0, via Wikimedia Commons
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
Content
Why did we organize this conversation?   4

What did we do, and how did we do it?    5

What emerged                             6

     Perspectives and provocations       6

     Imaginations and Implementations    9

     Listening and learning              10

What is next?                            11

Gratitude                                12
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
Why did we organize this                        the power dynamics of structured data,
                                                we can examine whose views, whose
                                                                                                colonial languages. Yet we are the
                                                                                                majority of the world’s population, the
conversation?                                   agenda, whose ontologies (categories of         majority of those online, and the ones
                                                classification), and whose decisions build       most impacted by how structured data is
Structured data is at the core of how the       and sustain these classifications and            used,     or  abused.     We    are    the
internet - as we currently know it - works.     systems. We can then work together to           communities that are most impacted by
These     are   pieces      of  information     build more just and equitable systems of        historical and ongoing structures of
organized in such a way that they can be        structured data.                                power and privilege and interlocking
easily read, understood, and processed                                                          systems of oppression, whether it is
by machines. Countless apps and                 This transformation of structured data          colonization,      patriarchy,     racism,
platforms are built upon such structured        systems - and knowledge itself - requires       homophobia,        classism,     casteism,
data systems, including tools created by        an       ongoing       conversation       and   ableism, and beyond. This is why we
content providers like Google and               commitment that is not limited to               need to start decolonizing the internet as
collaborative initiatives like Wikidata. As a   technology      companies       or   specific    a practice, not simply a metaphor: we
result, they permeate every corner of our       sectors. It needs to be a collective effort     need to transform the different forms of
daily lives online, from maps and               from individuals, organizations, and            power and privilege that shape our
navigation systems to movie reviews and         communities with different forms of             digital technologies.
rankings.                                       knowledge and expertise, a process we
                                                call allyship or “solidarity in action”. Most   Learn more about why knowledge
Through     these     systems,     massive      importantly though, this shared process         justice matters for structured data from
amounts of data get sorted out,                 needs to center the expertise, lived            this presentation by our co-director
organized, and classified in relation to         experiences, and leadership of the              Anasuya Sengupta.
other pieces of data. As a result, they are     minoritized majority of the world.
informed and structured by and around
specific regulations, traditions, and            As we’ve shared before, only a fraction of         What a powerful and heartfelt presentation on
epistemologies. In an attempt to                online public knowledge is produced on
                                                                                                   knowledge decolonization and its importance to
categorize the world, they prescribe            or by women, people of color, LGBTQI
certain frames and worldviews — in              folks, Indigenous communities, and
                                                                                                   structured data from @Anasuyashh
other words, they are far from the often-       peoples from the Global South, in                  @WhoseKnowledge at #WikidataCon2021.
assumed neutrality of data. By analyzing        languages that are not English or other            Thank u. We are not here to fill the gaps, but
4
                                                                                                   to restruct and rethink the ways of doing things.
                                                                                                   FLAVIA DORIA
                                                                                                   @AFLAVIADORIA ON TWITTER, 29 OCTOBER 2021
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
Decolonizing the Internet’s Structured
Data was born out of the pressing,
                                               What did we do, and how did                    lay the groundwork for key areas and
                                                                                              future concrete actions.
urgent need for such a collective              we do it?
conversation, centering those who are                                                         We carried out the conversation virtually,
often marginalized. On October 13th,                                                          and offered simultaneous interpretation
                                               Decolonizing the Internet’s Structured
2021, we invited over 40 knowledge                                                            in English, Portuguese and Spanish. We
                                               Data was organized in three different
activists, community organizers, tech-                                                        wanted everyone to be able to fully
                                               panels. In Panel 1, our guest speakers
builders, and other “unusual allies” to join                                                  express themselves in a language that
                                               offered Perspectives and Provocations to
a safe, multilingual, collaborative space.                                                    they felt comfortable with and to share
                                               frame the overall conversations. Then,
This group of thoughtful, powerful                                                            their experiences and knowledges with
                                               participants split into seven groups for
thinkers and doers, was mostly female-                                                        each other regardless of language
                                               discussions named Imaginations and
identifying (71%), in/from the Global                                                         barriers.
                                               Implementations.       Finally, listeners
South (66%), and indigenous/black/
                                               brought back key insights in a session of
people of color(s) (82%) in origin. The
                                               Listening and Learning, in which we
conversation served as a pre-conference
                                               shared reflections and ideas for the next
for WikidataCon 2021, and was jointly
                                               steps.                                             It has really been an extremely
organized    by    Whose      Knowledge?,
Wikimedia Deutschland, and Wiki                                                                   enlightening conversation, and
                                               We started the conversation by clearly
Movimento Brasil. As a result, we also
                                               laying out three core beliefs and
                                                                                                  listening to the voices from
prioritized    the     participation      of
                                               commitments:       love,   respect,    and         different academic disciplines, the
communities from Latin America and
                                               solidarity. Based on this set of guiding           dialogue of knowledges — which
the Caribbean.
                                               principles, we wanted participants to be           always enriches so much — and
                                               aware of their positionalities and
                                                                                                  not only that but also from so
                                               privileges, and to be able to be their full,
                                               multiple selves during the whole session.          many countries, and with so many
                                               Our goal was to promote interactions               languages. A little Babel.
                                               centered around listening and learning,
                                                                                                  MONICA CECILIA CALDERON
                                               and around respect (of each other and of           CLOSING SESSION, 13 OCTOBER 2021
                                               our time together). By engaging in
                                               meaningful conversations, we wanted to
5
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
Additionally, we established clear privacy    an academic at the University of Toronto,      categories that classify the world in a
practices in order to guarantee people’s      librarian, Wikidata enthusiast, and a          certain way. Participants navigated the
safety and wellbeing during the entire        citizen of the Métis Nation from Ontario,      nuances that emerge when we question
call. For instance, participants could keep   Canada. This initial panel grounded us in      how data is classified and based on
their cameras off if preferred and needed     what is at stake when we talk about            which     epistemologies,     while  also
to seek clear consent to publish and          structured data.                               centering the needs and the right of
attribute any identifiable quote from                                                         refusal and determination of our
another participant, before sharing it on     We began by situating the questions            communities.      How    do    we   build
social media during and after the event.      and frames we would cover during the           alternatives based on emancipation and
                                              event: what structured data is, and what       liberation? As beautifully put by our
Here is how the conversations unfolded.       it means to build systems around it that       panelists, this conversation needs to go
                                              center multiple epistemologies. In order       beyond      a   nice-to-have,   and   be
                                              to advance in those discussions, we were       recognized as an urgent matter that
What emerged                                  rooted in power analysis: questioning          requires resources in order to be
                                              how the control over resources — from          changed and improved.

Perspectives and provocations                 the design of platforms to the access to
                                              content in one’s languages — plays out         Here’s a sneak peek of this initial
                                              at different levels online. Questions such     discussion. Answers have been edited
The conversation began with a panel
                                              as “whose story?,” “who is the storyteller     and condensed for the sake of clarity and
named Perspectives and Provocations in
                                              and curator?,” and “whose systems of           length.
which we framed the urgent issues
                                              categorization?”      permeated      many
around decolonizing structured data,
                                              interventions. As the panel progressed,
and panelists explained what it meant to
                                              we engaged with the importance of
them. This first part counted on the
                                              structured data and laid the ground for
insights of Lydia Pintscher, who is a
                                              imagining other possibilities.
product manager at Wikidata; Matariki
Williams, a Māori writer, curator and
                                              In this first part    of the event, panelists
historian, who is currently at the Ministry
                                              challenged the       alleged neutrality of
of Culture and Heritage in New Zealand;
                                              structured data.    We framed structured
Paz Peña, an activist and feminist techie
                                              data as pieces of   ideology, not as neutral
based in Chile; and Stacy Allison-Cassin,
6
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
What does structured data mean, and why is it important                 What does it mean to have multiple knowledge frames,
that we talk about it?                                              multiple epistemic frames, or epistemologies at the heart of
                                                                                                               structured data?
We have all of these different systems that interact, that             One of the things I've just been thinking about is the way in
structure our movement through a space in particular ways,            which power plays a role in structured data. And as someone
and sometimes that way of thinking really funnels us [...]. Each             who is a user rather than a designer of structured data,
of those times when we encounter one of those systems of             especially as an indigenous person in a colonized country, the
structuring it, there is a little piece of ideology that               way that plays out in the areas in which I've worked is quite
conceptualizes the world in a particular way. It is an              huge. [...] We do not fit within the system, as the system has not
interaction between those pieces of information or data and         been designed by us. That has many implications in the way in
the information system that surrounds them. [...] So,                    which knowledge itself has been shared, or collected, and
structured data for me begins at that very fundamental place            gathered, and added to over time. [...] When we talk about
of how we structure the world around us, how we interact                 internal processes, from a Ministry perspective, it is about
with different information systems. Then we come along to a         making and structuring data in a way that makes our internal
system like Wikidata, which I think is really wonderful in a lot   work more efficient. But what does that mean for our partners?
of ways, like its ability to be a collaboration between                   Are we then expecting them to fit within a template that
communities, and to have some agency in changing some of                      enables our work, and are we no longer centering their
those knowledge structures.                                           aspirations on how they handle their own knowledge? [...] So,
                                                                     when it comes to thinking about approaching structured data
Image by Whose Knowledge?, CC BY-SA 4.0, via Wikimedia Commons
                                                                   from and with multiple epistemologies, it happens already, and
                                                                                       it happens naturally as an indigenous person.
                                                                                                  Image by Whose Knowledge?, CC BY-SA 4.0, via Wikimedia Commons

— STACY ALLISON-CASIN

7                                                                                                                                   MATARIKI WILLIAMS —
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
How can we re-imagine structured data, especially from a                                      What is one thing you would like to see done differently in
feminist and anti-colonial lens?                                                            structured data today, that will help us come to that space of
                                                                                                                             emancipation and liberation?
The structuring of the data in these systems is a historical
continuation of this white, Eurocentric, colonial, patriarchal                            In Wikidata, we are trying to provide structured data in a way
fantasy, which assumes that the data floats. They float free                                  that is more just, more equitable, more representative of the
from their origins, they are stripped from our bodies, from our                              different experiences and the world that we live in. [...] We're
history, and therefore they are free to navigate the world like a                                 trying to strike a balance between providing useful and
neutral currency, a universal currency. [...] I also believe that                          usable data that people can build applications upon, and at
structured data often act as a hegemonic force, because they                              the same time, doing justice to people, and helping them feel
literally drag ways of seeing the world, as hegemonic forces                                        represented, and seen in that data, and having it be a
do. This hegemonic force presents itself as inevitable, it seeks                           representation of their world. And that makes the data more
to be the only way to order the world. [...] If we want to see the                           complex in Wikidata sometimes. We regularly get told that
data structured from a feminist perspective, we have to fight                               our data is hard to use because of that. And if I could change
for the democratization of the discussion.                                                         one thing, it would be finding ways to bridge that gap.

Image by Whose Knowledge?, CC BY-SA 4.0, via Wikimedia Commons                                                            Image by Whose Knowledge?, CC BY-SA 4.0, via Wikimedia Commons

— PAZ PEÑA                                                                                                                                                     LYDIA PINTSCHER —

                                                                 WATCH THE RECORDING OF THIS PANEL
8
Decolonizing the Internet's Structured Data - SUMMARY REPORT - Whose ...
Imaginations and                             See snapshots from these conversations:

Implementations
During     this   part  of   the   event,
participants split up into seven small
groups that combined members of
different backgrounds and expertises.
Each group had a facilitator, as well as
someone in charge of listening and
reporting back to the larger plenary. This
time, we posed questions aimed at the
future: what would structured data with
multiple epistemic frames and anchors
look like? We asked folks to imagine an
internet in 2040 that has structured data
with multiple epistemic frames and
anchors at its core.

Groups detailed different elements
related to the topic: the need to
recognize that lived experiences go far
beyond online (structured) data; the
urgency to create, promote and expand                                                  Happy to participate in the Breakout Rooms
initiatives around literacy and education                                              of @WhoseKnowledge Decolonizing the
about data and its impacts on people,
                                                                                       Internet's Structured Data. The most
and the possibilities brought up by
centering marginalized communities in                                                  important outcome here is to think that
the design, ownership, and leadership of                                               seeking fairness in #KnowledgeGraphs should
online platforms.                                                                      not only be restricted to facts but also to
                                                                                       data models and logical constraints.
9
                                                                                       HOUCEMEDDINE TURKI
                                                                                       @CSISC1994 ON TWITTER, 13 OCTOBER 2021
Listening and Learning                      listeners and other participants offered
                                            these key insights:
This session called Listening and
Learning, followed our smaller group            ACCESS AND CONTROL OF DATA                  AGENCY/ENGAGEMENT
discussions. We invited listeners from
each group to report back to the plenary    ▶ Recognize        that      decolonizing   ▶ Deny people the excuses not to
and, once again, we had an amazing            structured data and the systems built       engage with the decolonization of
group of members: Thomas Hervé Mboa           upon them is a critical and urgent          structured data, by creating and
Nkoudou, a Cameroonian social scientist,      human rights issue, not simply a nice-      sharing resources and gradually
biochemist and maker; Maya Indira             to-have conversation on the margins.        create an ecosystem where there are
Ganesh, an Indian researcher and            ▶ Acknowledge        that    marginalized     no reasonable excuses not to
educator on tech, especially ethical AI;      communities — the minoritized               acknowledge, and represent diversity
Jani Pereira, a biologist based in São        majority of the world — have little to      of contexts and epistemologies
Paulo, Brazil; Constanza Verón, a             no access or control over the data that   ▶ Acknowledge that Indigenous and
historian and educator based in Buenos        governs their lives currently.              other    marginalized     communities
Aires, who is currently at Wikimedia        ▶ Share knowledge with different              should have the right to refuse to have
Argentina; Asaf Bartov, a Wikimedian,         communities about data, technical           their knowledges online, or to be
structured data enthusiast, a digital         tools, and how to inhabit and use           datafied, and the agency over what
librarian; Amanda Jurno, a Brazilian          online spaces and digital tools safely      information they share with the world.
professor      and      researcher     in     and fully.
communication studies, from Wiki            ▶ Be aware that a community-centered
Movimento Brasil, and Alex Hanna, a           vision for structured data may be
sociologist and Director of Research at       irreconcilable with the agendas of
the Distributed AI Research Institute.        some individuals and organizations
                                              invested in structured data, especially
They explained what had surprised,            those using it for profit over human
excited, or challenged them and what          well-being.
had been the most radically imaginative
and interesting ideas — as well as
suggestions for how to get there. Our
10
DISTRIBUTED DATA                          PLURALITY OF DATA
                                                                                         What is next?
▶ Develop ways of creating smaller and     ▶ Redesign systems and model them
  connected datasets that are governed       based on different knowledges and           For many participants, Decolonizing the
  by     marginalized      communities       communities’ glorious complexities.         Internet's Structured Data was a much-
  themselves, rather than by companies     ▶ Recognize that most of the world’s          needed opportunity to pause for a
  based only on profit.                       knowledge systems are not text-based        moment         and     imagine     radical
                                             (and might include a medium that            possibilities for structured data. It also
                                             involves songs, tales, artworks of          meant a chance of doing it in
                                             different kinds), and go beyond text to     community and across regions and areas
     PLANET-CENTERED REDESIGN
                                             find different forms of encoding data.       of study and of work. Moving forward,
▶ Be mindful of and incorporate in         ▶ Learn from local examples and their         many of them voiced their interest in
  discussions    the       environmental     specificities in order to create broader     organizing more collective spaces like
  impacts of the infrastructure needed       changes, especially because “local” is      this and advancing more concrete steps
  for the processing of massive              rarely “small”, especially in the Global    towards emancipatory practices.
  amounts of data (e.g., data centers).      South.
▶ Restructure systems while constantly     ▶ Commit         to     listening      and    As organizers, we recognize the enormity
  asking ourselves: what are they for?       accommodating                   different   of the challenge, and we knew from the
  What purpose do they serve?                perspectives in changing the nature         start that this conversation would need
                                             of structured data, knowing that            to be the first of many. As we move into
                                             decolonization is a process that takes      2022, we are keen to create and convene
                                             time and constant effort from               more      opportunities      to    radically
                                             multiple people and communities.            reimagine and redesign the internet’s
                                                                                         structured data through a feminist, anti-
                                                                                         colonial, anti-racist lens. With these next
                                                                                         steps in mind, we are staying connected,
                                                                                         through a mailing list, with everyone
WATCH THE RECORDINGS OF THE
                                                                                         who joined us for this first conversation.
LISTENING AND LEARNING PANEL
                                                                                         We will also keep our broader
AND THE
CLOSING SESSION
                                                                                         communities informed and involved,
                                                                                         through updates in our newsletter.
11
Gratitude
Decolonizing the Internet’s Structured
Data would not have been possible
without everyone who joined the
conversation, thank you to all our
amazing       panelists,  listeners  and
participants. Thank you to our fabulous
interpreters as well, without whom this
would not have been a multilingual
conversation. We also want to express
our gratitude to our co-conspirators for
this event, Amanda Jurno and Érica
Azzellini from Wiki Movimento Brasil,
and Sabine Müller and Dominik Scholl at
Wikimedia Deutschland. Last but not
least, we take this opportunity to thank
our incredible supporters, partners, and
allies at large, whose everyday support
made this conversation happen, and will
make the many more to come a reality.

                                           This report is licensed under a Creative Commons
                                           Attribution-ShareAlike 4.0 International License.
12
You can also read