Birds Recognition using Pitch Frequency in Matlab

Page created by Beatrice Hansen
 
CONTINUE READING
Birds Recognition using Pitch Frequency in Matlab
International Journal of Innovative Information Systems
                          & Technology Research 9(3):50-57, July-Sept., 2021

              © SEAHI PUBLICATIONS, 2021 www.seahipaj.org                    ISSN: 2467-8562

        Birds Recognition using Pitch Frequency in Matlab

                                 Woko, Ovunda & Kabari, Ledisi Giok

                                Department of Computer Science,
            Ignatius Ajuru University of Education, Port Harcourt, Rivers State, Nigeria

ABSTRACT
Birds sing, whistle or chirp. Whether they sing, whistle or chirp, it is of great importance for researchers
to extract the pitch frequency of their vocalization to deepen knowledge on their uniqueness. Pitch is an
important feature of birds’ vocalization which scientists can learn much about their distinguishing
inhabitants. To employ pitch as a feature, extraction method is fundamental to the researcher to have
accurate pitch estimation. The present technology of Digital Signal Processing plays important role in the
vocal signal extraction of bird’s pitches. This paper aims to use digital signal processing techniques in
Matlab to extract pitch frequencies of bird’s sounds to recognize their uniqueness. In this research, the
sounds of 13 different birds were collected from online website. The audio sounds of the birds were
recorded in 3GPP format accessible in MATLAB in WAV format. The names of the birds, their sounds
and duration of the record respectively are tabulated. The sampling frequency of collected data is 48,000
samples per second. We adopt Feature-driven development (FDD)- a form of Agile software development
methodology for this work. Matlab programming version R2018a tool was used for implementation. The
implementation follows our proposed model for pitch frequency detection. Time, frequency, and time-
frequency, power spectrum signal techniques are used. We observe and recognize the uniqueness of birds
by theirs pitch frequencies.
Keywords: Pitch, Frequency, Recognition, Digital Signal Processing, MATLAB

1.     INTRODUCTION
There are about 10,000 to 13,000 bird species and 200 to 400 billion individual birds worldwide toady
(birdlist, 2021). However, the question about the number of bird species in the world depends on the
taxonomist. Identifying and classifying birds individually may not be a trivial task because no two birds
are the same. What feature can distinctly distinguish, detect and recognize birds? With advent of digital
signal processing using Matlab, the distinct detection and recognition of birds can be explored based on
distinct feature extract and analysis.
Bird pitch frequency recognition using digital signal processing involves feature extraction method,
feature classification and species detection (Briggs et al., 2012). Typically feature extraction method uses
time domain analysis, frequency domain analysis and time-frequency analysis to extract vocal features
from a bird data (Ghoraani, 2011). Feature extraction using a time domain analysis is very useful for
stationary data (Ghoraani, 2011; Vundavalli, 2016), however, it is not good because bird’s vocal data is
highly non-stationary. Feature extraction using frequency analysis is obliging for non-stationary signals as
well as stationary signal, and bird produces non stationary vocal signals. This analysis extraction method
is useful to a degree but due to the lack of temporal evaluation, the feature extraction is limited for non
stationary, whereas time-frequency analysis feature extraction method provide information about both
temporal and spectral features (Ghoraani, 2011; Starkhammar, 2015; Ghoraani, 2011). This reason leads
to selecting the time-frequency analysis for feature extraction.

                                                    50
Birds Recognition using Pitch Frequency in Matlab
Woko & Kabari ….. Int. J. Innovative Info. Systems & Tech. Res. 9 (3):50-57, 2021

Digital signal processing is a technique essential for a wide range of applications, from data science to
real-time embedded systems, and MATLAB products make it easy to use signal processing techniques to
explore and analyze time-series data, provide a unified workflow for the development of embedded
systems and streaming applications (Starkhammar, 2015). Exploring signal processing with MATLAB
products help one analyze signals from a range of data sources. We can acquire data, measure, transform,
filter, and visualize signals and apply signal processing tools to preprocess and filter signals prior to
analysis; explore and extract features for data analytics and machine learning applications; analyze trends
and discover patterns in signals; and visualize and analyze in time, frequency and time-frequency domain
characteristics of signals.
The analysis of birds vocal sounds have increased in the speech processing domain in the past few
decade. Much of the reported research has concentrated on the identification of bird species by
appearance. It is parochial to identify the bird variety directly by looking at it, because there are many
varieties of birds living around us. Trivial reported topic is the analysis of bird chirps from species of the
same bird. It is uncommon way to quantify difference between bird inhabitants by analyze the pitch
frequency of bird chirps or vocal sound because it require modeling, data acquisition, preprocessing,
extraction and pitch frequency recognition.
The paper presents a model for birds’ recognition using pitch frequency digital signal processing in
MATLAB to prove their uniqueness. We acquire the data set of vocal sounds of different birds. Develop a
model for processing the pitch. Import into MATLAB version R2018a and estimate the pitch frequency.
This work will greatly benefit zoologists ecological studies, researchers and socio-cultural community
about birds uniqueness by their pitch frequencies. .

2. Review of Related Work
The speech processing community in the last few years, has researched many issues in bird vocalizations,
especially species classification (Connor et al., 2012; Fagerlund , 2014; Trifa et al., 2008).
Babacan et al. (2013) evaluated pitch tracking on singing voice, with PRAAT and RAPT providing the
best determination of voicing boundaries.
Lachlan et al., (2013) developed Luscinia package widely used in the community to analyze bird song. It
provides pitch extraction but requires supervision. The Luscinia GUI allows the user to select elements
that require pitch tracking, but even when elements are carefully selected the exported pitch tracks may
not always match the spectrogram.
Meliza et al. (2013) developed Chirp, a tool which allows the user draw a mask on the spectrogram to
improve pitch estimation.
Hansson-Sandsten et al. (2011) analyzed male great reed warbler’s song based on its extracted syllables.
The analysis on the syllables extracted were done using time frequency analysis.

3. METHODOLOGY
3.1      Selection of Bird’s Sound
The data source of birds sound for this work was collected https://www.youtube.com/watch?v=WhRpW0
cVmds&t=143s. The birds’ sounds were downloaded, recorded, saved and used. The sound sources
acquired from the online sources should be laboratory generated or should be noiseless (Hansson-
Sandsten, et al., 2011) to avoid pollution feature extraction values. Noise removal algorithm may be
implemented to the sound acquired. Although denoising the sound with noise removal algorithms take
away the less amplitude feature extracted values and that may result in less efficiency and less accuracy
of the detection of the pitch.
In this research, the sound of 13 different birds were collected from online website (Kiddopedia, 2019) on
14th April, 2021. The audio sounds of the birds were recorded in 3GPP format and accessible in
MATLAB in WAV format. The table 1 shows the Birds’ names, sounds and duration respectively. The
sampling frequency of collected data is 48,000 samples per second. We adopt Feature-driven
development (FDD) - a form of Agile software development methodology for this work. Matlab
programming version 2018a tool is used for implementation.

                                                     51
Woko & Kabari ….. Int. J. Innovative Info. Systems & Tech. Res. 9 (3):50-57, 2021

Table 1. Birds names, sounds and duration

No    Name of Birds                    Chirping Duration   Sound
                                       in Seconds
1     Bald Eagle                                5
                                                             Bald Eagle.3gpp
2     Blackbird                                 5
                                                             Blackbird.3gpp
3     Bluejay                                   5
                                                             Bluejay.3gpp

4     Bluetit                                   5
                                                             Bluetit.3gpp

5     Budgie                                    5
                                                             Budgie.3gpp

6     Crane                                     5
                                                             Crane.3gpp

7     Crow                                      5
                                                             Crow.3gpp

8     Eagle                                     5
                                                             Eagle.3gpp

9     Falcon                                    5
                                                             Falcon.3gpp

10    Nightingale                               5
                                                             Nightingale.3gpp

11    Owl                                       5
                                                             Owl.3gpp
12    Pigeon                                    5
                                                             Pigeon.3gpp

13    Quail                                     5
                                                                     Quail.3gpp

Data Source: from https://www.youtube.com/watch?v=WhRpW0cVmds&t=143s

                                               52
Woko & Kabari ….. Int. J. Innovative Info. Systems & Tech. Res. 9 (3):50-57, 2021

    3.2 Pitch Detection Model
    Our model collects birds sound, filters, processes time domain, frequency domain, power spectral feature
    extract and correlate as shown in the figure1.

Bird sound           Filter             Time              Frequency            Power            Feature      Autoco
(Signal)             Signal             Domain            Domain               Spectral         Extraction   rrelatio
                                                                                                             n

 Signal 1            Signal 1           Signal 1            Signal 1            Signal 1        Signal 1     Autocorr
                                                                                                             elation 1

 Signal2             Signal 2           Signal 2            Signal 2            Signal 2        Signal 2     Autocorr
                                                                                                             elation 2

                         .                 .                     .                 .                .           .
     .
                         .                 .                     .                 .                .           .
     .
     .                   .                 .                     .                 .                .           .
     .                                                                                                          .
 Signal n            Signal n           Signal n            Signal n            Signal n        Signal n     Autocorr
                                                                                                             elation n
    Figure 1: Pitch Detection Model for Birds

    3.3       Detection of Bird Species mathematically
    Auto-Correlation
    Correlation is used to calculate the commonness between two variables. The correlation values will be in
    between +1 and -1. When the correlation co-efficient is nearer to ±1, the two variables will have high
    quantity of common elements. When the correlation co-efficient is nearer to 0, the commonness between
    them is very low.
    If the time series x(t) is correlated with another time series h(t).
    The resultant correlation value will be r(τ ), that is r(τ) = x(t) * h(t) ……………………(1)
    Where: t is the time series
             τ is time lag
          * is convolution factor
    Correlation of a matrix or function with itself at different lags in time is called Auto-correlation.

    If the time series x(t) is correlated with itself, then the resultant will be rx as
     rx = x(t) ∗ x(t − τ ) …………………………………………..(2)
    where: rx is the auto-correlation value
           x(t − τ ) is delayed time series signal by τ
           τ is time lag
             (*) is convolution factor

                                                            53
Woko & Kabari ….. Int. J. Innovative Info. Systems & Tech. Res. 9 (3):50-57, 2021

Applying auto-correlation to our data, means a violation the assumption of instance independence
because, the bird sounds recorded start at different instances. Therefore auto-correlation method helps to
rationalize and infer.
In our auto-correlation perfect multicollinearity is almost impossible. Perfect multicollinearity is getting
the auto-correlation value to exact ±1. Perfect multicollinearity is impossible. Though the sounds of a
same bird same but the electronic equipment used to reduce these sounds are different and also due to the
differences in specifications of components in different recorders, this condition occurs, even, when
source files are taken from the online database. The auto-correlated co-efficient must be nearer to ±1 in
order to say that the given audio signal is similar i.e. classified values are equal to a particular bird values
in our database. When the auto-correlated co-efficient is nearer to 0 (zero), the taken signal is not similar
to that of audio signal saved

4.   RESULTS AND DISCUSSION
The results of the birds sound in table 1 are generated in time domain, frequency domain, power spectral
respectively. The implementation follows the proposed model.

Figure 2. Plot of Pitch Frequency of Bald Eagle in Frequency Domain

                                                      54
Woko & Kabari ….. Int. J. Innovative Info. Systems & Tech. Res. 9 (3):50-57, 2021

Figure 3. Plot of Frequency normalization and Power Spectrum Bald Eagle

Figure 4. Plot of Pitch Frequency of Blackbird in Frequency Domain

                                               55
Woko & Kabari ….. Int. J. Innovative Info. Systems & Tech. Res. 9 (3):50-57, 2021

Figure 5. Frequency normalization and Power Spectrum of Blackbird

DISCUSSION
Closely observe the plots, the pitch frequency values of the birds sound are uniquely different. In the
power spectrum plot, Bald Eagle pitch frequency is about 0.13 while Blackbird is 0.06 dB as shown in
table 2. The implementation for the 13 different sample of birds gave different pitch frequencies but for
the sake of page consumption we captured, Bald Eagle are Blackbird. This shows that no two birds are the
same.
Table 2. Birds Pitch Frequency difference in Power Spectrum (dB)
Name of Birds        Pitch Frequency in Power
                     Spectrum(dB)
Bald Eagle           0.13
Blackbird            0.06

CONCLUSION
The aim of this paper has been demonstrated by proposing a model for birds pitch frequency recognition
using digital signal processing techniques in Matlab to prove their uniqueness. The objectives are
achieved: data of vocal sounds of different birds were collected, model developed for processing pitch
frequency and power frequency spectrum implementation in MATLAB was carried. We have
demonstrated that birds can be recognition using pitch frequency using digital signal processing
techniques
   .
REFERENCES
1.    Birdlist (2021). List of Birds in the world. Retrieved 2021 from www.birdlist.org
1.    Briggs, F. (2012). Acoustic classification of multiple simultaneous bird species: A multi- instance
      multi-label approach. The Journal of the Acoustical Society of America, 131(6), pp. 4640–4650.
2.    Ghoraani, B. and Krishnan, S. (2011). Time–frequency matrix feature extraction and classification
      of environmental audio signals. IEEE transactions on audio, speech, and language processing,
      19(7), pp. 2197–2209.
3.    Vundavalli K. S. and Danthuluri V. S. R. S. (2016). Applied Signal Processing Blekinge Institute
      of Technology SE–371 79 Karlskrona, Sweden

                                                   56
Woko & Kabari ….. Int. J. Innovative Info. Systems & Tech. Res. 9 (3):50-57, 2021

4.    Starkhammar, J. and Hansson-Sandsten M. (2015). Evaluation of seven time
      frequency representation algorithms applied to broadband echolocation signals. Advances in
      Acoustics and Vibration, vol. 2015
5.    Connor, E. F. (2012). Automating identification of avian vocalizations using time- frequency
      information extracted from the gabor transform. The Journal of the Acoustical Society of America,
      132, 507–517
6.    Fagerlund, S., and Laine, U. K. (2014). New parametric representations of bird sounds for
      automatic classification. IEEE International Conference on Acoustics, Speech and Signal
      Processing (ICASSP) (pp. 8247–8251). IEEE.
7.    Trifa, V. M (2008). Automated species recognition of ant birds in a Mexican rainforest using
      hidden markov models. The Journal of the Acoustical Society of America, 123, 2424–2431.
8.    Babacan, O. (2013). A comparative study of pitch extraction algorithms on a large variety of
      singing sounds. In 2013 IEEE International Conference on Acoustics, Speech and Signal
      Processing (ICASSP) (pp. 7815–7819). IEEE.
9.    Lachlan, R. F. (2013). The progressive loss of syntactical structure in bird song along an island
      colonization chain. Current Biology, 23, 1896–1901.
10.   Meliza, C. D. (2013). Pitch-and spectral-based dynamic time warping methods for comparing field
      recordings of harmonic avian vocalizations. The Journal of the Acoustical Society of America, 134,
      1407–1415.
11.   Hansson-Sandsten M. (2011). Svd-based classification of bird singing in different time frequency
      domains using multitapers,” in Signal Processing Conference, 2011 19th European, pp. 966–970,
      IEEE, 2011.
12.   Kiddopedia (2019). Birds Names and Sounds Learn Bird Species in English. Retrieved 20 21 fro
      m https://www.youtube.com/watch?v=WhRpW0cVmds&t=143s

                                                  57
You can also read