Research Article - (2023) Volume 12, Issue 3

Application of Temporal Sentinel 1 SAR Data for Multiple Crop Type Classification in a Command Area
Sobhan Mishra*, Annie Maria Issac, Syama S Rao, Ronald Singh, P V Raju and V V Rao
 
Department of Geography, Allahabad University, Allahabad, India
 
*Correspondence: Sobhan Mishra, Department of Geography, Allahabad University, Allahabad, India, Email:

Received: 12-Mar-2020, Manuscript No. JGRS-23-3617; Editor assigned: 18-Mar-2020, Pre QC No. JGRS-23-3617(PQ); Reviewed: 01-Apr-2020, QC No. JGRS-23-3617; Revised: 24-May-2023, Manuscript No. JGRS-23-3617(R); Published: 21-Jun-2023, DOI: 10.35248/2469-4134.23.12.301

Abstract

Assessment of cropped area during kharif season is a difficult task due to the presence of cloud, cloud shadows and haze. So microwave datasets provide a good alternative as they have cloud penetration capacity. But deriving crop related information from microwave datasets is difficult task as it is subjected to different factors like phonological stage of the crop during satellite image acquisition, presence of speckle, polarization and the classifier used. In this study suitable filter polarization and classifier is identified by systematic analysis of time series, back scatter values derived from Sentinel 1 Synthetic Aperture Radar (SAR) data. As per the study, for the selected study area and time period, Sentinel 1-SAR images subjected to speckle removal by Intensity Driven Adaptive Filter (IDAN) filter proved to perform well in classification as against other filters. The time series of speckle removed VH polarization images classified using Random Forest classifier gave an accuracy of 45 percent in classifying paddy, nonpaddy and fallow.

Keywords

Speckle; IDAN filter; Polarization; Random forest classifier

Introduction

Mapping of major crop types with its spatial distribution is an important step for crop monitoring and yield estimation. The multispectral datasets for single and multiple dates from different satellite platforms were used in the identification of crop extent, its condition in different studies around the world. Cloud cover possesses a challenge on the availability of time series optical datasets which is very much necessary for the identification of crop types. In cloud cover conditions microwave datasets becomes very useful for monitoring crop types and its extent. Crop type information can be obtained from microwave datasets by using single scene or time series images. Time series datasets used to give a better degree of clarity for mapping out cropped areas. Temporal variation in backscatter coefficient (σ0) of Paddyfrom showing to vegetation is very predominant as paddy at the time of showing shows least backscatter due to ponded water and when the crop is established, it shows higher backscatter. The variation in the temporal trendsof backscatter values over Paddy, Non-Paddy and Fallow lands makes these three classes separable [1]. The back scatter obtained was correlated with crop age, its height and its biomass to identify different crop types. Microwave datasets are affected by speckle, which is corrected at the time of preprocessing. In order to extract information it becomes necessary to reduce speckle. Speckle is a salt pepper appearance that occurs due to interference between two waves reflected from scattering over terrain surface. Image filtering is a procedure in which the original pixel values are replaced with some values computed by using neighboring pixel values. There are two types of filter Non adaptive and adaptive filters. Non adaptive filter use same set of weights to smooth the entire image (Mean, median and mode) on the other hand adaptive filter use weights based on degree of speckle in the image (Lee, frost, Gamma, IDAN etc.). The image smoothing depends more on local statistics for adaptive filter, for which they preserve more details than non-adaptive filter.

Speckle corrected backscatter values generated from different polarization (HH, HV, VV and VH) exhibit similar temporal trends; the range of values over the same land cover varies depending on the polarization. It was found that time series backscatter values generated from HV polarization C band SAR images gave superior separabilityfor identifying crop types. Different studies were carried out by using different microwave frequencies with respect to biomass of the plant. The accuracy of in the identification of cropped area with its type from SAR datasets depends on polarization and frequencies.

Higher the frequency of microwave, lesser is the penetration power of microwave with respect to biomass and lower the frequencies higher the penetration of microwave with respect to biomass. Some studies were conducted in which multi frequencies (Ka, Ku, X, C, S and L) microwave datasets were used for identification of paddy fields and it was observed that L and S frequencies showed better correlation with respect to variation of biomass of crop [2]. With the advent of machine learning approaches the accuracy of classification of microwave images for obtaining crop type information had improved to a very high level. Classifying algorithms like maximum likelihood, gausian naive bayes, support vector machine, random forest, decision tree etc. are employed in satellite image classification along with training samples derived in conjunction with ground information.

This study aims for identification of suitable filters that can be used during preprocessing workflows for getting better distinction between major crop types and its spatial distribution. Importance was also given on selection of suitable machine learning classifier [3]. Sentinel 1 datasets were used for this purpose so importance was also given on selection of suitable polarization and dates for analysis purpose.

Materials and Methods

Study area

The Narayanpur irrigation system is a part of Krishna river basin, which falls in Gulbarga and Raichur district of Karnataka State as seen in Figure 1. It lies between 77°05’ to 76°77’ east longitude and 16°34’ to 16°6’ N latitude. The meteorology of the study area is, it experiences an annual precipitation of 711 mm and an annual average temperature of 27°centigrade. The maximum temperature in the region ranges from 39°centigrade and minimum temperature is about 22° centigrade. The system irrigates is expected to irrigate a maximum of 3,84,997 Ha [4]. The area has predominantly black cotton soil and red soil which is suitable for cultivation of paddy, cotton, legumes and millets.

JGRS-Location

Figure 1: Location of narayanpur irrigation system.

Datasets used in the study

Satellite datasets: The Sentinel 1 SAR images for the months of June to November of the year 2018, having 12 days temporal frequency were used in the study. Sentinel 1-SAR provides datasets in two polarization VV (Vertical Send And Vertical receive) and VH (Vertical send and Horizontal receive), with a spatial resolution of 10 m. Mostly Ground Range Detection (GRD) datasets in Interferometry Wide swath (IW) mode products were used in the study.

During the period of June to November 2018 Sentinel 1-SAR datasets were acquired over Narayanpur irrigation system for 14 dates [5]. Of the 14 dates, 3 dates were selected for analysis based on thephonological development of the crop through the season. Datasets used for analysis are illustrated in Figure 2.

JGRS-VH

Figure 2: VH and VV layerstacked images for Sentinel 1- SAR.

Ancillary datasets : Ancillary datasets like command area boundaries digitized from high resolution datasets and sub basin boundaries obtained from terrain processing of digital elevation models were used in the study (India-WRIS).

Ground truth datasets:The training samples for classification was derived in conjunction with ground truth information collected by using hand held Global Positioning Devices (GPS) in Narayanpur irrigation system during 18th, 19th and 20th September 2018. In this process agricultural field was geo-tagged and crop related information like crop types, its irrigation source, stage of growth and its date of sowing information’s were collected [6]. Around 150 points were collected. Distribution of ground truth points throughout the study area is illustrated in Figure 3.

JGRS-groundtruth

Figure 3: Location of groundtruth sample in narayanpur irrigation system.

The methodology for identification of paddy and non-paddy cultivated area from microwave datasets is illustrated in Figure 4. The steps involved are:

• Identification of suitable dates.

• Pre-processing of microwave datasets and identification of suitable filters.

• Identification of suitable polarization.

• Identification of suitable classifiers.

• Accuracy analysis by computing kappa statics.

JGRS-Flow

Figure 4: Flow chart for preprocessing and classification of Sentinel 1 SAR data.

Identification of suitable dates

Temporal backscatter values (σ0) profile was generated by using ground information collected, for both VH and VV polarization data for the datasets acquired during the period of June to November 2018. As observed in Figures 5 and 6 temporal backscatter coefficient profile for paddy, non-paddy and fallow profiles showed noticeable variations in the month of August [7]. Selecting the temporal backscatter values profile for entire season for paddy and non-paddy identification will complicate the analysis, Hence dates in month of August was selected.

JGRS-Temporal

Figure 5: Temporal profiles of backscatter values for VH polarization for paddy and non-paddy crops.

JGRS-profiles

Figure 6: Temporal profiles of backscatter values for VV polarization for paddy and non-paddy crops.

Preprocessing of microwave datasets

Preprocessing of Sentinel 1 SAR datasets was done with help of SNAP toolbox. Radiometric, geometric and speckle filter was applied on the datasets to obtain backscatter values of individual dates for data sets in both the polarization [8]. In radiometric correction sigma naught backscatter values (σ0) were calculated.

Then geometric correction was carried out to avoid geometric distortions, which is caused due to geometry of sensor and shape of the objects. Filters are applied on the derived backscatter values for removal of speckles. There are many filters available for speckle removal. On application of filter, the distribution of the backscatter values is altered making the image smooth, removing the salt and pepper appearance.

There are different types of filter for removing speckle. The filters used in this study are boxcar, frost, gamma, median, lee sigma and IDAN. Boxcar averaging is a data treatment method that enhances the signal to noise of a signal by replacing the group of data points with its average. Frost Filter is an exponenti ally damped circularly symmetric filter that uses localstatistics [9]. The pixel being filtered is replaced with the value calculated based on the distance from the filter center, the damping factor and the local variance. Gamma filter is used to reduce speckle while preserving edges in radius image. The filter assumes that the data is gamma distributed and the pixel is replaced with value computed locally. Median filter smooth’s the image, while preserving edges larger than kernel dimensions. It replaces each center pixel with the median value within the neighborhood specified by the filter size. Lee sigma filter utilizes the statistical distribution of digital number values within the moving window to estimate the pixel of interest. It is based on the assumption that mean and variance of the pixel of interest is equal to the local mean and variance of all pixels within a selected moving window [10]. IDAN filter is Intensity driven adaptive neighborhood filter is atechnique in which the neighborhood is formed by using region growing technique driven exclusively by intensity image information.

The distribution of backscatter values in VV and VH polarization for a date was compared to understand the effect of different filters on the data. A time series layer stack of backscatter values in VV and VH polarization was subjected separability analysis to understand the suitability of filter for paddy, non- paddy and fallow area identification. The separability analysis was carried out to find out the degree of similarity and dissimilarity between the classes used for classification [11]. The seperability is quantified in terms of Transformed Divergence and Jeffries Matusita distance. Transformed Divergence gives an exponential decreasing weight for increasing distances between classes. Jeffries Matusita distance, is a function of Separability that directly relates to the probability of how good a resultant classification will be.

Identification of suitable polarization for classification of paddy, non-paddy and fallow land

Temporal backscatter profile for paddy, non-paddy and fallow classes was obtained from time series backscatter images in both VV and VH polarisation. Using 150 points for paddy, nonpaddy and fallow derived from the ground information, statistical parameters like mean, maximum, minimum, standard deviation and range was computed for all the three classes from both VV and VH data sets. Thepolarisation with least overlap among classes was selected for classification purpose.

Classification by machine learning algorithms

Three different machine learning approach was used for identification of paddy, non-paddy and fallow classesnamely gaussian naive bayes, decision tree and random forest.

Gaussian Naive Bayes employs Bayesian networks which are structures for representing the probabilistic relationship among large number of variables and doing probabilistic inference with that variable. This classifier mostly dealt with discrete variables, for continuous variable, a conditional Gaussian network paradigm is required without discretizing them [12]. The decision tree classifier is characterized by the fact that an unknown sample is classified into successive one or several decision trees for processing. There is a unique relationship between the string and the decision tree. The tree is encoded as a string of symbol such that there is an unique relationship between string and decision tree. The string is decoded in computer and points are set up to define appropriate classification path for each data sample. Random Forest classifier is an ensemble learning method for classification and regression that operates by constructing multiple decision trees at training time outputting the class that is mode of classes (classification) or mean prediction (regression) of individual trees. Random decision forest correct intermediate decision trees by over fitting their training sets.

Classification was done on time series VH backscatter image using the above discussed classifiers and accuracy of classification was assessed using Kappa statistics.

Results and Discussion

Selection of suitable filter

Statistical analysis: On application of filters the distribution of the backscatter values modified. The statistics of the backscatter values in VV and VH polarization is summarized in Table 1. As observed, on application of filters, standard deviation increased and the range of back scatter values decreased making the image more suitable for classification [13]. As illustrated in Figures 7 and 8, the IDAN reduced the range of backscatter values with higher standard deviation, making it best suitable filter with respect to statistical analysis.

Filters VH polarisation  VV polarisation
Min Max Mean Std. dev Min Max Mean Std. dev
No filter -33.58 12.53 -18.32 2.87 -29.99 26.17 -10.34 3.64
Boxcar -27.32 3.31 -7.62 9.11 -24.88 22.47 -4.27 5.44
Frost -27.38 0 -7.58 9.1 -25.14 22.7 -4.25 5.44
Gamma -27.34 7.205 -7.58 9.1 -25.56 21.45 -4.25 5.43
Lee sigma -26.33 8.64 -7.56 9.06 -24.08 21.59 -4.23 5.38
Median -27.79 0.23 -7.6 9.13 -25.49 22.09 -4.28 5.47
IDAN -26.13 0 -7.62 9.15 -23.2 16.18 -4.34 5.48

Table 1: Statistical range of backscatter values VH and VV polarization.

JGRS-Back

Figure 7: Back histogram distribution for image without filter and with IDAN filter for VV polarization.

JGRS-histogram

Figure 8: Back histogram distribution for image without filter and with IDAN filter for VH polarization.

Separability analysis: Separability analysis was carried out for the temporal back scatter profiles generated for the selected dates filtered by the six selected filters. The transformed divergence and jeffries-matusita distance was estimated and summarized in Table 2. It can be observed that the backscatter values derived from Intensity Driven Adaptive Filter (IDAN) filter was observed to give high separability values in both transformed divergence and jeffries matusita distance.

Filters Transformed  divergence Jeffries Matusita distance
Paddy Non-Paddy Fallow Paddy Non-Paddy Fallow
Boxcar Paddy 0 1553 1886 0 1206 1303
Non-paddy 1553 0 1902 1206 0 1336
Fallow 1886 1902 0 1303 1336 0
Frost Paddy 0 1469 1862 0 1179 1265
Non-paddy 1469 0 1757 1179 0 1287
Fallow 1862 1757 0 1265 1287 0
Gamma Paddy 0 1173 1418 0 710 1076
Non-paddy 1173 0 1558 710 0 1058
Fallow 1418 1558 0 1076 1058 0
Lee sigma Paddy 0 1937 1501 0 1308 1197
Non-paddy 1937 0 1992 1308 0 1395
Fallow 1501 1992 0 1197 1395 0
Median Paddy 0 1632 1017 0 1149 972
Non-paddy 1632 0 1866 1149 0 1272
Fallow 1017 1866 0 972 1272 0
IDAN Paddy 0 1914 1995 0 1345 1387
Non-paddy 1914 0 2000 1345 0 1410
Fallow 1995 2000 0 1387 1410 0

Table 2: Seperability Values computed by transformed divergence and Jeffries Matusita distance for different filters.

Selection of suitable polarization for generation of time series backscatter values for classification

Box plot was generated for backscatter values of both VV and VH polarization for multiple dates selected as illustrated in Figures 9 and 10. It was observed that values in VH polarisation show lesser degree of overlap between paddy, non-paddy and fallow classes [14].

Hence, backscatter values in VH polarization was found better suited for classification as compared to back scatter values in VV polarization.

JGRS-boxplot

Figure 9: VH boxplot showing statistical variation.

JGRS-statistical

Figure 10: VV boxplot showing statistical variation.

Classification and accuracy assessment

Classification was carried out using three machine laerning algorithms namely random forest, gaussian naive bayes and decision Tree, on temporal back scatter profile in VH polarization subjected to IDAN filter. Training samples were generated from 10% of the ground truth points as illustrated in Figure 11. The crop maps generated are as seen in Figures 12-14. It was observed that the paddy area was over estimated by both Gaussian Naive Bayes and decision tree classifier.

JGRS-profile

Figure 11: Temporal backscatter profile for VH polarization for paddy, non-paddy and fallow.

JGRS-Crop

Figure 12: Crop map generated by Gaussian Naive Bayes classifier.

JGRS-map

Figure 13: Crop map generated by decession tree classifier.

JGRS-random

Figure 14: Crop map generated by random forest classifier.

The accuracy of classification was computed by estimation of Kappa statistics was computed for the maps generated from all the three classifier [15]. It was found that random forest classifier showed higher degree of classification accuracy with a Kappa value of 45 whereas Gaussian Naive Bayes and decision Tree gave a Kappa value of less than 25.

Conclusion

This study aimed for identification of suitable preprocessing methods and polarization of microwave data for identifying paddy, non-paddy and fallow areas using Sentinel 1 SAR datasets for Narayanpur irrigation system. Focus was given to identify cultivated lands within four weeks of sowing of paddy and non-paddy crops. Only sowing dates were taken into consideration for analysis in this study. Sowing dates gives a better discrimination between paddy, non-paddy and fallow. Intensity driven adaptive filter was found suitable as it showed higher degree of separability between paddy, non-paddy and fallow lands. VH polarized SAR data was best suited in this study area as the range of backscatter values showed minimum overlap among the classes. Random forest classifier proved to be best suited for classification of the temporal VH backscatter profiles for identification of the three classes under consideration. For identification of the paddy and nonpaddy classes, selection of dates during the sowing period in place of season time series will reduce the complexity in classification. Availability of adaptive fi lters for preprocessing and machine learning algorithms for classification can be best utilized for automated generation of crop maps from microwave datasets during Kharif season in absence of optical datasets.

References

Citation: Mishra S, Issac AM, Rao SS, Singh R, Raju PV, Rao VV (2023) Application of Temporal Sentinel 1 SAR Data for Multiple Crop Type Classification in a Command Area. J Remote Sens GIS. 12:301.

Copyright: © 2023 Mishra S, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.