EURISCO update 2023: the European Search Catalogue for Plant Genetic Resources, a pillar for documentation of genebank material

Page created by Alvin Reese
 
CONTINUE READING
EURISCO update 2023: the European Search Catalogue for Plant Genetic Resources, a pillar for documentation of genebank material
Published online 3 October 2022                                   Nucleic Acids Research, 2023, Vol. 51, Database issue D1465–D1469
                                                                                                     https://doi.org/10.1093/nar/gkac852

EURISCO update 2023: the European Search
Catalogue for Plant Genetic Resources, a pillar for
documentation of genebank material
Pragna Kotni1 , Theo van Hintum                          2
                                                             , Lorenzo Maggioni                3
                                                                                                   , Markus Oppermann                  1
                                                                                                                                           and
Stephan Weise 1,*
1
 Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstr. 3, 06466 Seeland,
Germany, 2 Centre for Genetic Resources, The Netherlands (CGN), Wageningen University & Research,

                                                                                                                                                              Downloaded from https://academic.oup.com/nar/article/51/D1/D1465/6746864 by guest on 07 January 2023
Droevendaalsesteeg 1, 6708 PB Wageningen, The Netherlands and 3 European Cooperative Programme for Plant
Genetic Resources (ECPGR), c/o Alliance of Bioversity International and CIAT, Via di San Domenico 1, 00153
Rome, Italy

Received July 29, 2022; Revised September 13, 2022; Editorial Decision September 16, 2022; Accepted September 23, 2022

ABSTRACT                                                                        vated plants and their wild relatives in the future, this di-
                                                                                versity must be preserved. Genebanks play an important
The European Search Catalogue for Plant Genetic                                 role in long-term preservation efforts. There are about 1800
Resources (EURISCO) is a central entry point for in-                            genebank collections of plant genetic resources for food and
formation on crop plant germplasm accessions from                               agriculture (PGRFA) worldwide, of which about 625 are in
institutions in Europe and beyond. In total, it provides                        Europe (9). It is a truism that something can only be used
data on more than two million accessions, making an                             if one has information about it. The best resource will not
important contribution to unlocking the vast genetic                            be exploited if it is not well documented, and without data,
diversity that lies deposited in >400 germplasm col-                            proper genebank management will not be possible (10). In
lections in 43 countries. EURISCO serves as the ref-                            other words, the better described PGRFA material is, the
erence system for the Plant Genetic Resources Strat-                            more valuable it is for potential users and the better it can
egy for Europe and represents a significant approach                            be preserved. In addition to pure management data and in-
                                                                                formation on the legal status of the material, it is therefore
for documenting and making available the world’s
                                                                                important to have information that allows users to select
agrobiological diversity. EURISCO is well established                           the most suitable material for breeding and research pro-
as a resource in this field and forms the basis for a                           grammes, especially passport data (about identity and ori-
wide range of research projects. In this paper, we                              gin) and phenotypic characterisations (about traits) of the
present current developments of EURISCO, which is                               genebank material (11).
accessible at http://eurisco.ecpgr.org.                                            The European Search Catalogue for Plant Genetic
                                                                                Resources (EURISCO) is an international aggregated
INTRODUCTION                                                                    database that aims to provide a central entry point for infor-
                                                                                mation on the large genetic diversity harboured in the col-
Crop plants are the basis of nutrition for humans and
                                                                                laborating collections. Presently, it contains data on more
farmed animals. However, biodiversity, including the agri-
                                                                                than two million genebank samples, so-called accessions,
cultural component, is threatened worldwide by various fac-
                                                                                which are preserved in >400 institutes in Europe and some
tors (1–4). The progressing climate crisis plays a special role
                                                                                neighbouring countries. EURISCO is maintained on behalf
here (5,6). In particular, rising temperatures, changing pre-
                                                                                of the European Cooperative Programme for Plant Genetic
cipitation patterns and the increasing frequency of extreme
                                                                                Resources (ECPGR) and is based on a network of National
weather events are leading to adverse effects, such as de-
                                                                                Inventory Focal Points, one in each of 43 member coun-
clining crop yields (7). The changes in the environment due
                                                                                tries. EURISCO has been available online since 2003 and
to the climate crisis are also forcing farmers to grow other
                                                                                from 2014 the Leibniz Institute of Plant Genetics and Crop
varieties or even other crops, with the danger of losing the
                                                                                Plant Research (IPK) Gatersleben, Germany, has been re-
old ones. Furthermore, crop related wild species are increas-
                                                                                sponsible for the operation and further development of the
ingly under threat of disappearance due to these environ-
                                                                                information system as well as the coordination of the EU-
mental changes (8). In order to combat this genetic ero-
                                                                                RISCO network of National Inventory Focal Points.
sion and be able to rely on the genetic diversity of culti-

* To   whom correspondence should be addressed. Tel: +49 39482 5 744; Fax: +49 39482 5 155; Email: weise@ipk-gatersleben.de


C The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which
permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
EURISCO update 2023: the European Search Catalogue for Plant Genetic Resources, a pillar for documentation of genebank material
D1466 Nucleic Acids Research, 2023, Vol. 51, Database issue

SYSTEM OVERVIEW                                                      Since the first description of EURISCO in the NAR
                                                                  database issue 2017 (12), the volume of data has been ex-
The National Inventory Focal Points of the EURISCO
                                                                  panding continuously. The number of data provider institu-
member countries are responsible for the development and
                                                                  tions rose from 376 to 402. The number of accessions docu-
maintenance of the National Inventories. These comprise
                                                                  mented in the system increased by >200 000 (13%), details
of data about the PGRFA collections maintained by local
                                                                  of which are illustrated in Table 1. During the same period,
curators in these countries under ex situ conditions. The
                                                                  the number of phenotypic data points increased by 2.2 mil-
data management of these collections is in many cases sup-
                                                                  lion (500%). This shows that EURISCO is steadily gaining
ported by IT systems that allow data exchange with the Na-
                                                                  acceptance as a repository of such data. Moreover, data on
tional Inventory Focal Points, which in turn regularly up-
                                                                  plant genetic resources are not static. Rather, they are con-
date the data in EURISCO (12). The basis for the core of the
                                                                  tinuously curated and expanded. On average, between 30
data collection (passport data) is the Multi-Crop Passport
                                                                  and 40 National Inventory datasets are updated each year,
Descriptors standard (MCPD) (13). MCPD is the globally
                                                                  either in part or in full. These updates include data of on

                                                                                                                                    Downloaded from https://academic.oup.com/nar/article/51/D1/D1465/6746864 by guest on 07 January 2023
recognised standard for the documentation and exchange
                                                                  average 350 000–400 000 accessions per year. Regular train-
of passport data on plant genetic resources and enables the
                                                                  ing workshops, organised with the support of ECPGR, are
acquisition of data following a well-defined and legally legit-
                                                                  held to support the data providers, which aim, among other
imised procedure. Additionally, a EURISCO-specific for-
                                                                  things, to raise awareness of various aspects of data quality
mat is used for the exchange of phenotypic data. The pro-
                                                                  and to successively improve it.
cedures for exchanging data with the National Inventory
                                                                     Stable and unique identifiers are an important prerequi-
Focal Points are well established, but are constantly being
                                                                  site for the operation of aggregated databases such as EU-
adapted and improved.
                                                                  RISCO (10), allowing the information in EURISCO to be
   EURISCO runs on an Oracle relational database man-
                                                                  linked easily to information in other data sources, espe-
agement system using its specific technologies like PL/SQL
                                                                  cially where it concerns genomics and other omics data,
and Oracle Application Express. The database backend in-
                                                                  that are not included in EURISCO. Digital Object Identi-
cludes a staging area for pre-processing and data cleans-
                                                                  fiers (DOIs) have established themselves as a quasi-standard
ing (64 tables) as well as performance-optimised database
                                                                  in the field of plant genetic resources in recent years (14).
structures for the web frontend (48 tables). The >200
                                                                  Therefore, the increasing acceptance of DOIs is a very posi-
functions and procedures within 23 packages mainly pro-
                                                                  tive development. In close cooperation with the Secretariat
vide functionalities for data quality assurance, but also for
                                                                  of the ITPGRFA, EURISCO offers its data providers the
user-specific download options, visualisations and statistics,
                                                                  service of carrying out the DOI registration of their acces-
among others. A web frontend with a broad range of func-
                                                                  sions on their behalf. This service is well established, as re-
tions is available for the users of the system. The frontend is
                                                                  flected in the continuous increase in the number of DOIs.
meant to be user-friendly for any type of the targeted users,
which include plant breeders and researchers, but also farm-
ers, policy makers and other scholars. The web frontend is        ENHANCED WEB INTERFACE
developed with Oracle Application Express technology, ver-
sion 21.                                                          EURISCO’s central user interface has recently been com-
                                                                  pletely revised. It offers a variety of ways to find informa-
                                                                  tion (Figure 1). In addition to various wizard-based stan-
                                                                  dard searches, indexes of common crop names and taxo-
UPDATED DATABASE CONTENT AND STATISTICS
                                                                  nomic terms are offered to provide searches at different lev-
Currently, more than two million PGRFA accessions are             els. Map-based searches are also possible. The results can
documented in EURISCO. These include major cereal                 then be narrowed down using further filters, e.g. faceted
grains such as wheat (205 088), barley (126 330) and maize        searches (Figure 2). Various data export features allow data
(67 869), pulses such as beans (55 344), peas (38 914) and        to be downloaded for later use. In addition to passport data
vetches (29 438), edible oilseed crops such as sunflower          searches, wizards offer the possibility to search for pheno-
(7735) and linseed (20 900), tuber crops such as potatoes         typic data.
(15 139), accessions of the Brassica complex (35 308), but           One feature that should be highlighted is the sophisti-
also a very large research collection of Arabidopsis thaliana,    cated taxonomic search in EURISCO. When searching by
a model plant in life sciences (684 967) is included. In          taxonomic names, users often face the challenge that differ-
total, the accessions documented in EURISCO comprise              ent PGRFA collections follow different taxonomic schools
6737 genera and 45 175 species. 428 160 of the accessions         and traditions, resulting in a variety of accepted names and
are part of the Multilateral System of Access and Ben-            synonyms. Therefore, a feature was implemented for EU-
efit Sharing (MLS) of the International Treaty on Plant           RISCO that automatically maps the scientific names of all
Genetic Resources for Food and Agriculture (ITPGRFA),             accessions against different taxonomic repositories repre-
which means that they have been confirmed by the respec-          senting different taxonomic opinions. This leads to much
tive countries to be promptly available under standard and        more complete result sets. Currently, the mapping is done
internationally agreed benefit sharing conditions. In addi-       against GRIN Taxonomy (15) as well as the Mansfeld’s
tion, there are 2.6 million records of phenotypic data from       World Database of Agricultural and Horticultural Crops,
more than 3800 experiments conducted on genebank mate-            which is based on the Mansfeld’s Encyclopedia of Agricul-
rial.                                                             tural and Horticultural Crops (16). The taxonomic search
Nucleic Acids Research, 2023, Vol. 51, Database issue D1467

Table 1. PGRFA accessions of the EURISCO database grouped by genus and species. The ten most common genera are broken down in detail. The total
number of accessions per genus is shown in the column ‘Accs. total’. In addition, the figures from the first description of EURISCO in the NAR database
issue 2017 (12) and the percentage change since then are shown
Genus                                    Species                 No. of accs.             Accs. total            Weise et al. (2017)         Increase
Arabidopsis                              thaliana                  684 967                 685 192                    669 587                 2.33%
                                         others                      225
Triticum                                 aestivum                  145 483                 205 088                    147 055                 39.46%
(wheat)                                  durum                     17 482
                                         turgidum                  15 084
                                         monococcum                 3624
                                         spelta                     3419
                                         dicoccum                   1187
                                         aethiopicum                1048
                                         others                    17 761

                                                                                                                                                          Downloaded from https://academic.oup.com/nar/article/51/D1/D1465/6746864 by guest on 07 January 2023
Hordeum                                  vulgare                   116 283                 126 330                    105 289                 19.98%
(barley)                                 spontaneum                 6172
                                         others                     3875
Zea                                      mays                      67 743                   67 869                     61 932                 9.59%
(maize)                                  others                      126
Phaseolus                                vulgaris                  50 211                   55 344                     49 774                 11.19%
(garden bean)                            coccineus                  2329
                                         others                     2804
Solanum                                  lycopersicum              22 429                   53 295                     44 400                 20.03%
(tomato, potato, eggplant, etc.)         tuberosum                 15 139
                                         andigenum                  2814
                                         melongena                  2219
                                         others                    10 694
Vitis                                    vinifera                  38 295                   47 488                     28 819                 64.78%
(grape)                                  others                     9193
Avena                                    sativa                    33 838                   42 405                     29 429                 44.09%
(oat)                                    sterilis                   2401
                                         byzantina                  1988
                                         others                     4178
Pisum                                    sativum                   37 424                   38 914                     29 735                 30.87%
(pea)                                    others                     1490
Malus                                    domestica                 32 553                   34 558                     27 698                 24.77%
(apple)                                  others                     2005
others                                                                                     723 029                    649 034                 11.40%
Total                                                                                     2 079 512                  1 842 752                12.85%

feature was first described in (17) and has been continuously                   context, it should not go unmentioned that an extension
expanded since then.                                                            of EURISCO to include data on in situ crop wild relatives’
   Another feature is the uniform search for common crop                        populations is currently under preparation and will further
names. Thus far, there is no globally accepted controlled vo-                   strengthen the role of the system.
cabulary for this. As a result, the PGRFA accessions in EU-                        In addition, work is ongoing to expand EURISCO’s role
RISCO are described with arbitrary crop names or no crop                        as a repository of phenotypic data on PGRFA accessions.
name at all, which makes a uniform search for accessions of                     The amount of corresponding data has increased signifi-
a specific crop across all accessions impossible. For this rea-                 cantly in recent years. Compliance with the criteria defined
son, a mechanism was developed that automatically maps                          by FAIR (18) is an important goal for today’s research data
all accessions in EURISCO against the crop names used in                        management. However, the phenotypic data managed in
GRIN (15), regardless of what data, if any, were provided                       EURISCO cannot fully comply with these criteria. In the
by the National Inventories for ‘crop name’. This makes it                      genebank community, a major challenge is that there is a
possible to search uniformly across all two million acces-                      large variety of small, relatively poorly described pheno-
sions and to achieve significantly better matching results.                     typing datasets. Some of these are several decades old, but
                                                                                represent important sources for the use of PGRFA mate-
                                                                                rial and are indispensable against this background. Due to
SUMMARY AND PERSPECTIVES                                                        the history of the data, traits and methods used are often
EURISCO’s main task is to provide a central entry point                         heterogeneous. Thus, an important goal is to achieve full
for information on plant genetic resources maintained ex                        compliance with MIAPPE (Minimum Information about
situ in participating institutions. This presents an impor-                     Plant Phenotyping Experiments (19)) in order to improve
tant contribution to opening up a large part of the millions                    the FAIRness of the data. To further improve access to the
of accessions preserved in germplasm collections worldwide                      data available in EURISCO, the implementation of Breed-
for research and breeding. In particular, old landraces and                     ing API (BrAPI) (20) endpoints is being evaluated. More-
crop wild relatives documented in EURISCO can provide                           over, the various research projects EURISCO is involved in
important contributions to future breeding efforts. In this                     are used to develop and test further extensions, such as an
D1468 Nucleic Acids Research, 2023, Vol. 51, Database issue

                                                                                                                                                          Downloaded from https://academic.oup.com/nar/article/51/D1/D1465/6746864 by guest on 07 January 2023
Figure 1. The different search capabilities of EURISCO.

Figure 2. Overview of the EURISCO web interface. (A) The result set of a search query, which can be further narrowed down using facets. (B) A map-based
search. (C) A detailed view of passport and phenotypic data of a selected accession.
Nucleic Acids Research, 2023, Vol. 51, Database issue D1469

improved linkage with genetic data. The web interface will                         density substantially affect ecosystem function despite invariant
continue to be developed in the coming years.                                      richness. Ecol. Lett., 20, 1315–1324.
                                                                              5.   Habibullah,M.S., Din,B.H., Tan,S.H. and Zahid,H. (2022) Impact of
   EURISCO is also the European node of the interna-                               climate change on biodiversity loss: global evidence. Environ. Sci.
tional PGRFA information system Genesys, operated by                               Pollut. Res., 29, 1073–1086.
the Global Crop Diversity Trust, and provides the passport                    6.   Scheffers,B.R., Meester,L.D., Bridge,T.C.L., Hoffmann,A.A.,
data of the accessions maintained ex situ. The phenotypic                          Pandolfi,J.M., Corlett,R.T., Butchart,S.H.M., Pearce-Kelly,P.,
data mentioned above are provided exclusively through EU-                          Kovacs,K.M., Dudgeon,D. et al. (2016) The broad footprint of
                                                                                   climate change from genes to biomes to people. Science, 354, aaf7671.
RISCO. The same applies to the in situ crop wild relative                     7.   Qian,B., Jing,Q., Bélanger,G., Shang,J., Huffman,T., Liu,J. and
data that will soon be included in EURISCO. Together with                          Hoogenboom,G. (2018) Simulated canola yield responses to climate
systems like Genesys or GRIN-Global, EURISCO forms a                               change and adaptation in Canada. Agron. J., 110, 133–146.
vital component of the global information system (GLIS)                       8.   Khoury,C.K., Brush,S., Costich,D.E., Curry,H.A., de Haan,S.,
                                                                                   Engels,J.M.M., Guarino,L., Hoban,S., Mercer,K.L., Miller,A.J. et al.
(21). It is expected that EURISCO will remain an impor-                            (2022) Crop genetic erosion: understanding and responding to loss of
tant resource in the field of plant genetic resources manage-                      crop diversity. New Phytologist, 233, 84–118.

                                                                                                                                                           Downloaded from https://academic.oup.com/nar/article/51/D1/D1465/6746864 by guest on 07 January 2023
ment and use, and will continue to form the basis for a wide                  9.   Engels,J.M. and Maggioni,L. (2012) AEGIS: A regionally based
range of research projects. Its further strengthening and de-                      approach to PGR conservation. In: Maxted,N., Dulloo,M.,
velopment has been included among the 2030 targets of the                          Ford-Lloyd,B., Frese,L., Iriondo,J. and Pinheiro de Carvalho,M.
                                                                                   (eds). Agrobiodiversity Conservation: Securing the Diversity of Crop
Plant Genetic Resources Strategy for Europe (22).                                  Wild Relatives and Landraces. CABI, pp. 321–326.
                                                                             10.   Weise,S., Lohwasser,U. and Oppermann,M. (2020) Document or lose
                                                                                   it – on the importance of information management for genetic
DATA AVAILABILITY                                                                  resources conservation in genebanks. Plants, 9, 1050.
                                                                             11.   Hoisington,D., Khairallah,M., Reeves,T., Ribaut,J.M., Skovmand,B.,
EURISCO will be continuously maintained and updated,                               Taba,S. and Warburton,M. (1999) Plant genetic resources: what can
and is publicly accessible at http://eurisco.ecpgr.org. After                      they contribute toward increased crop productivity?. Proc. Natl.
its launch in 2003 as a product of an EU-funded project,                           Acad. Sci. U.S.A., 96, 5937–5943.
continuity of funding of EURISCO has been guaranteed in                      12.   Weise,S., Oppermann,M., Maggioni,L., van Hintum,T. and
the past twenty years with contributions mainly from the                           Knüpffer,H. (2017) EURISCO: the European search catalogue for
                                                                                   plant genetic resources. Nucleic Acids Res., 45, D1003–D1008.
European countries that are members of ECPGR.                                13.   Alercia,A., Diulgheroff,S. and Mackay,M. (2015) In: FAO/Bioversity
                                                                                   Multi-Crop Passport Descriptors V.2.1 (MCPD V.2.1). Food and
                                                                                   Agriculture Organization of the United Nations (FAO). Rome, Italy;
ACKNOWLEDGEMENTS                                                                   Bioversity International, Rome, Italy.
                                                                             14.   Alercia,A., López,F., Sackville Hamilton,N. and Marsella,M. (2018)
The authors wish to thank the network of the EURISCO                               In: Digital Object Identifiers for Food Crops - Descriptors and
National Inventory Focal Points as well as all institutions                        Guidelines of the Global Information System. Food and Agriculture
providing data for their hard work and great commitment.                           Organization of the United Nations (FAO), Rome, Italy.
                                                                             15.   Wiersema,J.H. (1995) Taxonomic information on cultivated plants in
                                                                                   the USDA/ARS Germplasm Resources Information Network
FUNDING                                                                            (GRIN). Acta Horticulturae, 413, 109–116.
                                                                             16.   Hanelt,P. (2001) Mansfeld’s Encyclopedia of Agricultural and
Development and maintenance of EURISCO: Euro-                                      Horticultural Crops. Springer, Berlin, Heidelberg.
pean Cooperative Programme for Plant Genetic Re-                             17.   Kreide,S., Oppermann,M. and Weise,S. (2019) Advancement of
sources (ECPGR), Rome, Italy. Funding for open ac-                                 taxonomic searches in the European search catalogue for plant
                                                                                   genetic resources. Plant Genet. Resources: Character. Util, 17,
cess charge: Leibniz Institute of Plant Genetics and Crop                          559–561.
Plant Research (IPK) Gatersleben, Germany; Deutsche                          18.   Wilkinson,M.D., Dumontier,M., Aalbersberg,I.J., Appleton,G.,
Forschungsgemeinschaft (DFG, German Research Foun-                                 Axton,M., Baak,A., Blomberg,N., Boiten,J.W., da Silva Santos,L.B.,
dation) [491250510, in part].                                                      Bourne,P.E. et al. (2016) The FAIR guiding principles for scientific
                                                                                   data management and stewardship. Sci. Data, 3, 160018.
Conflict of interest statement. None declared.                               19.   Papoutsoglou,E.A., Faria,D., Arend,D., Arnaud,E.,
                                                                                   Athanasiadis,I.N., Chaves,I., Coppens,F., Cornut,G., Costa,B.V.,
                                                                                   Ćwiek Kupczyńska,H. et al. (2020) Enabling reusability of plant
REFERENCES                                                                         phenomic datasets with MIAPPE 1.1. New Phytologist, 227, 260–273.
1. Eastwood,N., Stubbings,W.A., Abdallah,M.A.A.E., Durance,I.,               20.   Selby,P., Abbeloos,R., Backlund,J.E., Basterrechea Salido,M.,
   Paavola,J., Dallimer,M., Pantel,J.H., Johnson,S., Zhou,J.,                      Bauchet,G., Benites-Alfaro,O.E., Birkett,C., Calaminos,V.C.,
   Hosking,J.S. et al. (2022) The Time Machine framework: monitoring               Carceller,P., Cornut,G. et al. (2019) BrAPI – an application
   and prediction of biodiversity loss. Trends Ecol. Evol., 37, 138–146.           programming interface for plant breeding applications.
2. Backhaus,T., Snape,J. and Lazorchak,J. (2012) The impact of                     Bioinformatics, 35, 4147–4155.
   chemical pollution on biodiversity and ecosystem services: the need       21.   Shaw,P.D., Weise,S., Obreza,M., Raubach,S., McCouch,S., Kilian,B.
   for an improved understanding. Integr. Environ. Assess. Manag., 8,              and Werner,P. (2023) Database solutions for genebanks and
   575–576.                                                                        germplasm collections. In: Ghamkhar,K., Williams,W. and
3. Cardinale,B.J., Duffy,J.E., Gonzalez,A., Hooper,D.U., Perrings,C.,              Brown,A.H.D. (eds). Plant Genetic Resources for the 21st Century –
   Venail,P., Narwani,A., Mace,G.M., Tilman,D., Wardle,D.A. et al.                 The OMICS Era. chapter 16. Apple Academic Press, Palm Bay, FL,
   (2012) Biodiversity loss and its impact on humanity. Nature, 486,               USA.
   59–67.                                                                    22.   ECPGR (2021) In: Plant Genetic Resources Strategy for Europe.
4. Spaak,J.W., Baert,J.M., Baird,D.J., Eisenhauer,N., Maltby,L.,                   European Cooperative Programme for Plant Genetic Resources,
   Pomati,F., Radchuk,V., Rohr,J.R., Van den Brink,P.J. and De                     Rome, Italy.
   Laender,F. (2017) Shifts of community composition and population
You can also read