Comparative Pattern Analysis of Cretan Folk Songs

 
CONTINUE READING
∗
    Comparative Pattern Analysis of Cretan Folk Songs
                                Darrell Conklin
            Department of Computer Science and Artificial Intelligence
                           Universidad del Paı́s Vasco
                             San Sebastián, Spain
                 IKERBASQUE: Basque Foundation for Science
                                 Bilbao, Spain
                                     conklin@ikerbasque.org

                                 Christina Anagnostopoulou
                                Department of Music Studies
                                    University of Athens
                                       Athens, Greece
                                       chrisa@music.uoa.gr

                                                Abstract
           This paper reports on data mining of Cretan folk songs for distinctive patterns. A
        pattern is distinctive if it occurs with higher probability in a corpus as compared to
        an anticorpus. A small set of Cretan folk songs was collected, organized using a small
        knowledge base of classes, and mined using distinctive pattern discovery methods.
        Several highly distinctive and confident patterns emerge.
    keywords: ethnomusicology, data mining, folk song analysis, pattern discovery, classifi-
cation

1       Introduction

In recent years there is a renewed interest in folk song analysis partly driven by increasing
interests in cultural heritage, and also by advances in music informatics methods. The
    ∗
    This is the author’s version of the work. It is posted here by permission of ACM for your personal
use. Not for redistribution. Please cite the definitive version which was published at MML’ 10, 3rd
International Workshop on Machine Learning and Music, October 25, 2010, Florence, Italy,
pages 33–36.

                                                  1
ability to make predictions, from music content, of song properties such as region, dance
type, tune family, instrumentation, modality, and social function is an important part of
the management of large corpora. Music data mining methods play a key role in building
predictive models for folk song classification.
   The folk music of Crete, like all traditional Greek music, can be divided primarily into
dance and non-dance (which are further subdivided into “Tavla” or “songs of the table”
and “road” songs). There are various types of songs belonging to each of these categories.
For dance songs, the most common types found in Crete are Kontylies and Pentozalis,
Malevisiotis, Syrtos and Sousta. The songs that are not danced are Rizitika, Tambahaniotika,
and other table songs, and then various function type songs, such as lament songs, wedding
songs, and lullabies.
   Although most of these types of songs are found throughout the island, each area has its
own characteristic musical style. The music of the West of Crete is quite distinct from the
music of the East, although there are also many common songs known to the whole island.
Therefore, in stylistic analysis of this music, it is necessary to look not only for patterns that
distinguish between song types, but also how the music of each region differs.
   This paper reports on some initial results obtained with a small collection of folk tunes
from various villages across Crete, exploring the hypothesis that there may exist patterns of
correlation between origin, function and type of song on the one hand, and melodic patterns
on the other.
   As an analysis method we have chosen the MGDP method [4, 5] which discovers patterns
that are distinctive and maximally general in a corpus. This paper will discuss the encoding
of a collection of pieces into a knowledge base for analysis, describe the application of the
MGDP method to the corpus, report on some interesting patterns found, and finally present
some ideas for future work.

                                                  2
2     Method

This section describes the data preparation and data mining methods employed. The data
mining method finds patterns that are over-represented in a positive set (called the corpus)
with respect to a background set of pieces (called the anticorpus).

2.1    Corpus

Three main sources were used for the Cretan folk songs: the first one was the collection of
transcriptions done by Samuel Baud-Bovy in 1953-54 [3], published by the Laographic Music
Archives of Melpo Merlie in Athens. It contains old and unknown songs which were recorded
and transcribed in various villages around the island. The second source was by Peristeris
[8], as found in the Amargiannakis archives at the Academy of Athens and the University of
Athens. The third one was a collection of well-known songs, transcribed for lyra playing [2].
Finally, a few pieces were added from the transcriptions done by Theodossopoulou [9].
    Songs from printed scores were encoded using the Sibelius and Finale software, and
exported to symbolic midi format for computational analysis. Following encoding of the
pieces, a knowledge base was developed to describe the classes of all pieces. Four broad
categories were chosen: song type, song hypertype, area, and hyperarea (see Figure 1).
    Encoded songs were classified into the song types classification specified by Baud-Bovy
[3]. Hyperarea classification was according to a map of Crete, and hypertype according to
the classification of Greek folk music specified by Amargiannakis [1] and Baud-Bovy [3] and
inspection of song lyrics.
    While preparing the corpus and classifying the songs, several interesting points came up
for discussion, and decisions had to be taken:

    • as mentioned above, there are song types found throughout the island, such as the new
      year song (kalanda). These songs were therefore placed in all classes of area (and

                                             3
hyperarea);

   • there are also songs which might have been recorded at a specific village (e.g., a small
      village of Chania), but these songs are sung all through the Western part of the island:
      These songs were thus added to both rethymno and chania areas;

   • for a few songs, although the song type was clear, there was no indication in the sources
      regarding its area. Therefore, in our classification, these pieces are not considered part
      of any class for either area or hyperarea mining;

   • finally, some songs were noted as simply being Tavla songs, without any further song
      type classification, and from the music it was not possible to deduce a more specific
      description. In our opinion this meant that they should be treated as not known for
      the song type category, though they still participated in the song hypertype class
      tavla.

A total of 106 songs was in our final collection. Figure 1 depicts the categories along with
their classes and the number of pieces in each class.

2.2    MGDP discovery

A pattern is a sequence of event features. A rich set of features, including rhythmic ones,
can be ascribed to events [4], but for this study only melodic intervals are used due to the
small corpus size. A piece x instantiates a pattern P , written P (x), if the pattern occurs
(possibly multiple times) in the piece: if the components of the pattern are instantiated by
successive events in the piece.
   A pattern P subsumes (is more general than) a pattern Q if all instances of Q are also
instances of P : if ∀xQ(x) ⇒ P (x) is valid (true in all possible corpora). For example, the pat-
tern [+3] subsumes the pattern [+3, +1], which in turn subsumes, for example, [+2, +3, +1]

                                               4
hyperarea                                      hypertype

                                                                   west (64)          east (47)                  tavla (61)      dance (51)

                                                                                                    song type

   sousta (5)   wedding (14)   city song (7)     pentozalis (11)     kontylies (17)       lament (8)       nanourisma (12)     epic song (1)   malevisiotis (1)   rizitiko (8)   syrtos (17)   kalanda (1)

                                                                                                        area

                                               chania (30)          rethymno (29)                 herakleion (12)             lassithi (35)       syria (4)

Figure 1: The categories and classes used in this study, along with the number of pieces in
each class.

                                                                                                       5
and [+3, +1, −4].
   To rank discovered patterns it is possible to partition a collection of pieces into an explicit
anticorpus (denoted    ), contrasting it with the analysis corpus (denoted ⊕). A distinctive
pattern is one that is frequent and sufficiently over-represented in the corpus as compared
to the anticorpus. An intuitive way to measure this is according to the relative empirical
probability of a pattern in the corpus and anticorpus:

                                                 def    p(P |⊕)
                                       ∆(P ) =                                                (1)
                                                        p(P | )

where

                                           def
                                  p(P |⊕) = c⊕ (P )/n⊕
                                           def
                                  p(P | ) = c (P )/n

where c⊕ (P ) (c (P )) is the total count of a pattern P in the corpus (anticorpus), and n⊕
(n ) is the number of pieces in the corpus (anticorpus).
   A pattern P is called a maximally general distinctive pattern (MGDP) if it is distinc-
tive (with at least a specified minimum ∆(P )), and if there does not exist a more general
(subsuming) pattern that is also distinctive. Thus the set of all MGDP is the top border of
the virtual pattern subsumption taxonomy such that all patterns above the border are not
distinctive.
   Patterns are ranked primarily by distinctiveness (Equation 1). Additionally, for every
distinctive pattern the confidence of the association rule P ⇒ ⊕ is computed. This is the
probability of the class given the pattern:

                                                c⊕ (P )
                                                  def
                                      p(⊕|P ) =         ,                                     (2)
                                                 c(P )

                                                   6
where c(P ) is the piece count of pattern P in the entire piece collection, which may differ
from the quantity c⊕ (P ) + c (P ) due to overlap between classes.

3     Results

For each category hyperarea, area, song hypertype, and song type the MGDP set is
found by in turn considering each class as the corpus ⊕ and all remainding classes as the
anticorpus   . For example, for the area category, considering the class chania, the classes
rethymno to syria (see Figure 1) are taken together as the anticorpus.
    For all experiments, melodic intervals patterns with ∆(P ) ≥ 3 were sought. The mini-
mum p(P |⊕) used was 20% of the corpus. Also, very sparse classes syria, kalanda, male-
visiotis, and epic song were not considered as positive corpora as it is possible for even
the low 20% threshold to be satisfied by just a single piece.
    Table 1 shows a subset of patterns found, presenting 3 patterns for each category that are
highly distinctive with ∆(P ) ≥ 5, with piece count c⊕ (P ) ≥ 5. Several patterns also have
high confidence (Equation 2). In the following we discuss one pattern from each category.
    In the dance song hypertype the pattern [+4, −4] is found in all types of dances, through-
out the island. The instances of this pattern are found in fast isochronous passages. Four
instances, taken from different types of dances are shown in Figure 2 (top row). In most
instances the second note occurs on a relatively strong beat.
    In the west hyperarea, an interesting, frequent, and high confidence pattern [−4, +2, −3]
was found. The pattern occurs in nearly 25% of the west pieces, and does not occur at all in
east pieces. This pattern is also distinctive, but to a lesser degree, in the rethymno area.
In diatonic terms, this represents a drop of a M3, an ascending M2, followed by a descending
m3, for instance, E-C-D-B or A-F-G-E. Figure 2 (middle row) shows seven instances of the
pattern: the second and third are found in the same song, but with a different surrounding

                                              7
class ⊕             pattern P           c⊕ (P )/n⊕   p(⊕|P )   ∆(P )

                                           area
           lassithi         [−7, +4]                 7/35        0.88     14.2
           lassithi [−2, −1, +1, −1, +1, −1]         7/35        0.88     14.2
           rethymno       [−4, +2, −3]               10/29       0.71      6.6

                                       song type
           syrtos               [−5, −2]             5/17        1.00      ∞
           kontylies            [+3, −7]             5/17        0.83     25.0
           kontylies          [+1, +2, +3]           5/17        0.83     25.0

                                       hyperarea
           west               [−4, +2, −3]           14/64       1.00      ∞
           east           [+1, +2, −2, +2, −2]       13/47       0.81      5.9
           east           [−2, +2, −2, −1, +1]       11/47       0.79      5.0

                                   song hypertype
           dance                [+4, −4]         16/51           0.94     19.1
           dance            [−4, +2, +2, −4]     11/51           0.92     13.2
           dance              [+4, +1, +2]       19/51           0.90     11.4

      Table 1: Patterns found in the Crete folk song corpus, organized by category.

Figure 2: Instances of the dance pattern [+4, −4], west pattern [−4, +2, −3], and syrtos
pattern [−5, −2].

                                            8
context. Thus the phrase is varied while this core pattern remains intact.
    For the syrtos song type, an interesting and high confidence pattern [−5, −2] was found.
The pattern occurs in no other song types. In diatonic terms, this pattern represents a
descending P4 followed by a descending M2, for instance, A-E-D or E-B-A. Figure 2 (bottom
row) shows five instances of the pattern. The rhythmic and metrical aspects of the instances
varies considerably, indicating the abstraction provided by the melodic interval pattern.

4     Conclusions

Though the amount of data is small (just over 100 pieces), some interesting findings have
emerged. First, in the hyperarea category, all highly distinctive patterns are within the
dance class (data not shown). Also, two maximally distinctive patterns (not occurring in
any other classes) were found, in the syrtos song type, and the west hyperarea.
    The method and knowledge base developed is extensible and readily admits adding fur-
ther Crete folk songs. In addition, a novelty here was the allowance for pieces to participate
in multiple classes.
    In this experiment the MGDP algorithm was applied by taking each class in turn as
the corpus; alternatively it is entirely possible to group classes together (e.g., wedding and
lament) to form a compound corpus. Concurrently, the data mining algorithm could easily
be modified to use a background song hierarchy so, for example, the pattern [−4, +2, −3]
would not have been reported twice (in both area and hyperarea) but only the most
distinctive occurrence.
    It is clear that more songs are needed, especially to more thoroughly populate the classes
of the area and type categories. Even with a threshold of 20%, for small classes it is too
easy to reach this threshold, and this is borne out by the preponderance of type patterns
from small classes such as rizitiko, city song, sousta, and lament (data not shown: in

                                              9
Table 1, the total output was first filtered to retain only the frequent distinctive patterns).
   There are some general considerations to bear in mind with the analysis of Cretan folk
music (and folk music in general): first, as mentioned above, transcription of the songs
deviates from the live performance [6]. Second, these songs are found in various villages and
places, with minor variations; the transcriptions available to us did not notate the variations.
Third, there are songs that go beyond the boundaries of their origin, and they can be found in
other areas with possible variations. Fourth, a song of the table might become at a different
instance a song of the road, and even a dance song [7] if different lyrics are sung. Finally,
in Cretan music especially, a lot of these songs are performed by various artists, and each
lyra artist has a unique style of playing, which in this case means adding small characteristic
turns and motives. Information about performer might be taken into account but would
require more data collection for distinctive pattern analysis.

Acknowledgments

This work was partially funded by the John Latsis Foundation (www.latsisfoundation.org),
through the project grant entitled “Comparative computational analysis of Cretan music”,
in the call “Meletes 2009”. Many thanks also to N. Poulakis, R. Pylarinos, and A. Pikrakis
for encoding and discussions on the corpus.

References

[1] G. Amargiannakis. Gia mia morphologia tou ellinikou dimotikou tragoudiou (towards
   a morphology of the greek traditional song). Technical report, Department of Music
   Studies, University of Athens, 1994.

                                              10
[2] E. Andreoulakis and S. Petrakis. Music and Mantinades in Crete. Philippos Nakas Music
   Publications, Athens, Greece.

[3] S. Baud-Bovy. Mousiki Katagrafi stin Kriti 1953-1954 (Music recordings in Crete 1953-
   1954). Folk Music Archives Melpo Merlie, Athens, Greece, 2006.

[4] D. Conklin.    Discovery of distinctive patterns in music.    Intelligent Data Analysis,
   14(5):547–545, 2010.

[5] D. Conklin. Distinctive patterns in the first movement of Brahms’s String Quartet in C
   Minor. Journal of Mathematics and Music, 4(2):85–92, 2010.

[6] M. Dragoumis. I Paradosiaki mas Mousiki (Our Traditional Music) Vol. 2. Folk Music
   Archives Melpo Merlie, Athens, Greece, 2009.

[7] M. Rombou-Levidi. Psifides horou ston evro. In L. Droulia and L. Liavas, editors,
   Mousikes tis Thrakis: Mia diepistimoniki Proseggisi. Syllogos oi Filoi tis Mousikis, Evros,
   1999.

[8] G. Spuridakis and S. Peristeris. Greek Folk Songs. Academy of Athens Publications,
   Athens, Greece, 1968.

[9] E. Theodossopoulou. Oi skopoi stin Elliniki Paradosiaki Moysiki (Tunes in Greek Tradi-
   tional Music). PhD thesis, Department of Music Studies, University of Athens, Athens,
   Greece, 2003.

                                             11
You can also read