FILTERING FAKE PRODUCT REVIEWS BY USING n GRAM APPROACH

Page created by Ken Singh
 
CONTINUE READING
Mukt Shabd Journal                                                                                      ISSN NO : 2347-3150

         FILTERING FAKE PRODUCT REVIEWS
            BY USING n GRAM APPROACH
                                                 Ms.N.Padmapriya,
                                                 Assistant Professor,
           Department of Computer Science and Engineering, SSM Institute of Engineering and Technology, Anna
                                           University,Dindigul, Tamil Nadu

                                                   A.Soundharya,
                                                     UG Scholar
           Department of Computer Science and Engineering, SSM Institute of Engineering and Technology, Anna
                                           University,Dindigul, Tamil Nadu

                                                     V.Subalakshmi,
                                                     UG Scholar
           Department of Computer Science and Engineering, SSM Institute of Engineering and Technology, Anna
                                           University,Dindigul, Tamil Nadu

                                                     R.Sugapriya
                                                     UG Scholar
           Department of Computer Science and Engineering, SSM Institute of Engineering and Technology, Anna
                                           University,Dindigul, Tamil Nadu

       Abstract--- As the trend to shop online is increasing day by day and more people are interested in buying the
       products of their needs from the online stores. This type of shopping reduces the shopping time and travelling
       time of the customers. Customers go to online store, search the item of their need and place the order. But, the
       thing by which people face difficulty in buying the products from online store leads to bad quality of the
       product. Customer place the order only by looking at the rating and by reading the reviews related to the
       particular product. Such comments of other people are the source of satisfaction for the new product buyer.
       Here, it may be possible that the single negative review changes the angle of the customer not to buy that
       product. In this situation, it might possible that this one fake review makes the loss and business. So, in order
       to remove this type of fake reviews and provide the users with the original reviews and rating related to the
       products, we proposed a Fake Product Review Monitoring and Removal System (FaRMS) which is an
       Intelligent Interface and takes the Uniform Resource Locator (URL) related to products of Amazon, Flipkart
       and Daraz and analyzes the reviews, and provides the customer with the original rating. It is a unique quality
       of the proposed system that it works with the three e-commerce Websites and not only analyzes the reviews in
       English but also the reviews written in Tamil and Hindi. The proposed work achieved the accuracy of 87% in
       detecting fake reviews, of written in English by using intelligent learning techniques which is greater than the
       accuracy of the previous systems.

Volume IX, Issue VII, JULY/2020                                                                                Page No : 2132
Mukt Shabd Journal                                                                                              ISSN NO : 2347-3150

                                                             1.INTRODUCTION
       There are different ways to shop like people can buy a specific thing of their need by going to a store or mall. In this
       style of shopping the seller gives the feedback of the product, they do not know whether he/she is giving a fake
       feedback or original. Because, it is upon seller honesty, how much the seller is true in his/her words and people have
       to carefully examine the product because people do not have any
       other option in examining the product. If customer do not pay attention in buying that product then it may be proved
       a waste for them. On the other hand, nowadays source of shopping has been changed. People can buy the products
       from the online stores of different brands. Here, customer have to place the order without seeing and examining the
       original product. Every customer reads the reviews and buy the product. Therefore, we are dependent on the reviews
       about the product. These reviews may be the original or fake. The customer wants to buy an original and reliable
       product, which is possible only when you get the original feedback related to that product. can find the original
       feedback and rating related to a specific product .Then, it is the source of satisfaction and reliability for customer. In
       the proposed technique, the reviews related to a product for which the URL is given are extracted. After it, the system
       finds the fake reviews and finally by analyzing these reviews system finds the original reviews of the product. Previous
       researches detect fake reviews using different approaches including identification address, opinion mining and
       sentiment analysis, machine-learning approach. There are many researches available (English, Tamil, Hindi ) detect
       fake reviews for Therefore, we have proposed Fake Product Review Monitoring and Removal System (FaRMS) in
       which a customer can get the best possible item from the online store in a short time and with the original reviews
       associated with that product. This system gives you the original words of people related to the product with genuine
       reviews. Some popular products can get hundreds of reviews at some large merchant sites and FaRMS gives you the
       promising reviews by filtering fake reviews and then customer can decide whether they want to buy or not. Fake
       review monitoring system focuses on detecting spam and fake reviews by using sentimental analysis removes the
       reviews which have curse and vulgar words. In the proposed system web crawler is used to scrap the data on the
       Website. In the preprocessing, the data is converted into the required format and then the fake reviews are removed
       from the mixture of original and spam reviews. Fake reviews are detected by the Fake Review Detector.

                                                    2.LITERATURE SURVEY

       Anjay K S., [1] In this paper, they proposed a framework to detect fake product reviews or spam reviews by using
       Opinion Mining. The Opinion mining is otherwise called as Sentiment Analysis. In sentiment analysis, they try to
       figure out the opinion of a customer through a piece of text. They first take the review and check if the review is
       related to the specific product with the help of Decision tree and use Spam dictionary to identify the spam words in
       the reviews. In Text Mining they apply several algorithms and on the basis of these algorithms we get the specific
       results.

       Rakibul Hassan., [2] Decision making on online products for purchase was mostly depends on reviews given by the
       users. Hence, for their own interests, opportunistic individuals or groups try to manipulate the product reviews. To
       detect fake online reviews, this paper introduces some semi-supervised and supervised text mining models as well as
       it compares the efficiency of both techniques on dataset containing hotel reviews.

Volume IX, Issue VII, JULY/2020                                                                                         Page No : 2133
Mukt Shabd Journal                                                                                            ISSN NO : 2347-3150

       Nidhi A. Patel., [3] Based on reviews or opinions that are written by others based on their experiences helps the
       customers to take decision directly. In this competitive world any person can write anything, this raise the number of
       fake reviews. Various companies are hiring people to write fake positive reviews about their services or products or
       unfair negative reviews about their competitors' services or products. Hence we need a system to detect such fake
       reviews and remove them. In this paper we discuss various supervised, unsupervised and semi supervised data mining
       techniques for fake review detection based on different features.

       Jiandun Li., [4] In this paper, how to distinguish whether a review is fraudulent or a reviewer is a spammer has long
       been studied, but the question of general review pattern mining is still open. In this paper , they model online product
       review systems into bipartite networks and adopt a network technique, called the weighted motif to uncover underlying
       reviewing patterns.

       Anusha Prabakaran., [5] In this paper, a statistical credibility scoring mechanism to identify spam reviews. It consists
       of three components: detection of duplicate reviews, detection of anomaly in review count and rating distribution, and
       detection of incentivized reviews. These three methodologies complement each other to effectively indicate the
       credibility of product reviews without requiring significant computational resources. It can aid data mining and online
       spam filtering systems to filter out spam product reviews and refine product rankings.

                                                     3. PROPOSED SYSTEM

       In this proposed system, the sentences that are not related with the quality of a product such as customer service or
       sentence related to the. In this paper the preprocessing is done by Support Vector Machine (SVM). First of all it
       removes the comments which neither is nor related with the quality of the product. Second stage describes the weights
       of the reviews based on the votes. The final stage calculates the overall ranking of the product. The ranking score is
       calculated by the relevance of the review with quality of the product, review content, and posting date of the review.
       In the evaluation process they use two measures to quantify effectiveness of the ranking model which are as following:
       correlation between the ranking method and the Amazon’s rank and second is the Mean Average Precision (MAP),
       which is a very commonly used technique for Evaluating ranking accuracy. As this system is finding the fake reviews
       by using the only two properties of the reviews but as per the future work describes in the paper more properties can
       be used to find out the fake reviews more accurately.

Volume IX, Issue VII, JULY/2020                                                                                       Page No : 2134
Mukt Shabd Journal                                                                                          ISSN NO : 2347-3150

                                Reviews

                     Data Pre processing

                        Review Analysis

                           Sentiment
                          Classification

                                Results

                                                     Figure 1 System Architecture

                                                       Module Description
       Admin : This module describes about the admin of this project. Here administration is the area of the Web store that
       is accessed by the merchant to manage the online shop. In general a store manager is able to add and edit products,
       categories, shipping and payment etc. Admin is the person who maintains the whole system. Admin updates the new
       products of the branded company.

       User Registration and Login : To search and rate any product in this application all user must registered with the
       system. User must register with this application for access the product list uploaded by the admin. User registers in
       this application with their details such as Name, gender, age, mobile number, email, username and password. After
       successful registration user able to login into the system then search for a product and rate any product based on the
       satisfaction they realized in that product.

Volume IX, Issue VII, JULY/2020                                                                                     Page No : 2135
Mukt Shabd Journal                                                                                              ISSN NO : 2347-3150

       Product Upload : In this application admin can only add new product details to search and rated by user. Admin need
       to login into this system by providing his/her username and password, after successful login admin can add the new
       product details such as product name, product ID, Quantity of product and cost, description and the image of the
       particular product. These details are important to search by the user. The products are search by their name and/or
       which category the product belongs to.

       User Rating : After successful login user can search for a particular product and rate the product. The rating is based
       on the 5 Star Rating. If the user give 5 star to a product it’s 100% satisfied the user, and if any user give 1 Star for a
       product then it’s not up to user’s expectations. Based on the user rating the product is recommended to other user in
       the same category or in same age group. This rating is very important for the system to maintain the high rated and
       mostly liked products and recommend it to other user of this application.

       Recommendation : Three kinds of methods were proposed to compute better recommendation algorithms. They are
       instance selection, time window and ensemble learning.A set of dynamic features are used to describe users multi-
       phase preferences.

                                               Annotate reviews

                                                                     Yes

              Review                               Acceptab
             collection                               le
                                                                     No

                                              Calculate Accuracy
              Dataset
            Partitioning

                                            Run on Test Data sets

             Sentiment
             Analysis
                                                 Build Model

            Map sentiment
              with trained                        Classify using
              data having                       identified vectors
            fakeness labels

                                                 Modify Vectors

Volume IX, Issue VII, JULY/2020                                                                                         Page No : 2136
Mukt Shabd Journal                                                                                               ISSN NO : 2347-3150

                                                Figure 2 Module Interface Diagram

       Fake view Detection : Some challenging e-shopping website can give fake feedback to the original website through
       some persons. To achieve popularity in internet marketing, these activities can be done. To find the fake review of the
       products user’s IP address are stored at every product rating. Rating rate will be cancel if particular user’s IP address
       repeated with different rating for the same product. This system automatically gives the notification .

                                                     4. Results and Discusssion

       In this chapter , we discuss the various output results.

       Home Page :

       Admin Login :

       Add Product :

Volume IX, Issue VII, JULY/2020                                                                                        Page No : 2137
Mukt Shabd Journal                ISSN NO : 2347-3150

       View Product :

       Customer View :

       Purchase Report :

       Rating and Suffusions :

Volume IX, Issue VII, JULY/2020        Page No : 2138
Mukt Shabd Journal                ISSN NO : 2347-3150

       User Registration :

       User Login :

       Product View :

       Display the detail :

Volume IX, Issue VII, JULY/2020        Page No : 2139
Mukt Shabd Journal                ISSN NO : 2347-3150

       Rating Suggestion :

       Purpose :

       Rating :

       Review the Products :

Volume IX, Issue VII, JULY/2020        Page No : 2140
Mukt Shabd Journal                                                                                           ISSN NO : 2347-3150

       Fake Review Message :

       Reviewing Turn over Messages :

                                                            5. Conclusion

       In the proposed work, dataset is developed that contains Urdu and Roman Urdu reviews. It is difficult to detect fake
       reviews by yourself. So, n-gram approach is used to detect fake reviews for multiple languages. It is observed that the
       text categorization with SVM classifier is best approach for the detection of fake reviews. Now a days, as the
       technology is growing day by day and there are so many Websites and applications available in the online market by
       which seller can sell their products and on that products there are millions of reviews available. There are some
       organizations posting fake reviews for the products of the seller in order to increase or decrease the rating of the
       products. Therefore, the system is proposed that detects the fake reviews in multiple languages including English,
       tamil, and Hindi, classify the reviews in genuine.

Volume IX, Issue VII, JULY/2020                                                                                      Page No : 2141
Mukt Shabd Journal                                                                                        ISSN NO : 2347-3150

       References

       [1] A. Sinha, N. Arora, S. Singh, M. Cheema, and A. Nazir, “Fake Product Review Monitoring Using Opinion
       Mining,” vol. 119, no. 12, pp. 13203– 13209, 2018.

       [2] Torbet, Georgina. “U.S. Customers Spent over $6 Billion on Black Friday Purchases.” Digital Trends, Digital
       Trends, 25 Nov. 2018, www.digitaltrends.com/ web/shopping-totals-blackfriday/.

       [3] Sterling, Greg. “Study Finds 61 Percent of Electronics Reviews on Amazon Are 'Fake'.” Marketing Land, 19 Dec.
       2018, marketingland.com/study-finds-61-percent-of electronics-reviews-on-amazon-are-fake 254055.

       [4] K. Khan, W. Khan,A. Rehman, A. Khan, Asfandyar. Khan, A. Ullah Khan, B. Saqia, "Urdu Sentiment Analysis,"
       (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 9, No. 9, 2018.

       [5] A. Mukherjee, B. Liu, and N. Glance, "Spotting Fake Reviewer Groups in Consumer Reviews," 2012.

       [6] A. Mukherjee, V. Venkataraman, B. Liu, and N. Glance, “What Yelp Fake Review Filter Might Be Doing?,” Aaai,
       pp. 409–418, 2013.

       [7] Z. Wang, Y. Zhang, and T. Qian, “Fake Review Detection on Yelp Dataset and features,” pp. 1–6.
       [8] S. Xie, G. Wang, S. Lin, and P. S. Yu, “Review spam detection via temporal pattern discovery,” p. 823, 2012.

       [9] C. Paper, “Mining millions of reviews : A technique to rank products based on importance of reviews Mining
       Millions of Reviews : A Technique to Rank products Based on Importance of Reviews,” no. November, 2015.

       [10] V. K. Madhura N Hegde, Sanjeetha K Shetty, Sheikh Mohammed Anas, “Fake product review monitoring,” Int.
       Res. J. Eng. Technol., vol. 05, no.06, p. 4, 2018.

Volume IX, Issue VII, JULY/2020                                                                                   Page No : 2142
You can also read