Indian Journal of Data Mining
EISSN : 2582-9246
Published by: Lattice Science Publication (LSP) (10.54105)
Total articles ≅ 6
Articles in this journal
Indian Journal of Data Mining, Volume 2, pp 1-14; https://doi.org/10.54105/ijdm.c1618.051322
Safety on roads and prevention of accidents are the prime concern of any highway system. Data mining is a source of retrieval of information for knowledge discovery approach. Many data mining methodologies have been applied to accident data in the recent past years. There is need to analyze the relationship between different factors related to accidents i.e. number of persons affected by fatal, minor, grievous, non-injury, road feature (ROF), road condition (ROC), cause of accident (CAU) and vehicle responsible (VR) according to daily, fortnightly, semi-fortnightly and monthly basis. The objective of this study is divided into three sub-objectives. The First sub-objective of this study is to divide number of accident dataset of National Highway sections of Karnataka state implemented by Project Implementation Unit i.e. PIU (Bangalore, Chitradurga, Dharwad, Gulbarga, Hospet and Mangalore) during January 2012 to January 2017 collected from NHAI (National Highway Authority of India) in homogeneous clusters using K-means clustering. The second sub-objective is to reflect the relationship between different factors i.e. a number of persons affected by fatal, minor, grievous, non-injury, CAU, ROC, ROF and VR using Apriori association rule. The last sub-objective is to perform temporal trend analysis for each cluster on the basis of rules generated by Association Rule Mining.
Indian Journal of Data Mining, Volume 1, pp 15-20; https://doi.org/10.54105/ijdm.c1619.051322
Consumer sentiment analysis has gained immense attention in the recent past. The abundance of data in today’s world, especially those generated from the social media platforms, has triggered sentiment exploration like never before. The analysis of consumer sentiments have indeed helped organizations in effective decision making worldwide. In the communication technology domain, voice activated virtual assistants (VAVAs) are one of the latest entrants and they are gaining immense popularity by the time. Brand sentiment studies on VAVAs being limited in number creates an opportunity to explore further. This study fits into the domain of sentiment mining and the purpose of the paper is to review the consumer sentiment towards the global leader brand in the voice activated virtual assistant product segment, Amazon Alexa. Of the various approaches available, the researchers chose unsupervised learning based lexicon approach to estimate the brand sentiment. Three popular lexicon based sentiment classifiers, TextBlob, VADER and AFINN, have been used in the present context for exploration purpose. To the best of the knowledge of the researchers, this research effort includes, for the first time, multiple lexicon based approaches in exploring the sentiment towards the brand Alexa. This study shows consumers to have a significantly positive sentiment towards the chosen brand. The output from the three comparative classifiers reveal similar results which also validates the robustness of the outcomes and that of the chosen methods. The study anticipates a bright sales potential of the brand. Also, the use of alternative lexicon approaches is expected to enrich the existing literature in the sentiment mining domain.
Published: 10 November 2021
Indian Journal of Data Mining, Volume 1, pp 20-26; https://doi.org/10.54105/ijdm.b1615.111221
Data mining is a rapidly developing technology that has enriched a lot of field such as business analysis, market analysis, weather forecasting, stock market analysis and many more. It starts with collecting data sets from reliable sources and pre-processing that data. There are some anomalies associated with data collected in large volumes such as outliers, missing values, and duplicated values. Remove these kinds of anomalies is teamed as pre-processing of data. In this paper, collection of weather data and pre-processing it for rainfall prediction model using Rapid Miner tool has been discussed. Also, artificial neural network data mining techniques is used to design a rainfall prediction model. ANN classification techniques is a complex data mining technique results in high accuracy in prediction of rainfall.
Published: 10 November 2021
Indian Journal of Data Mining, Volume 1, pp 14-19; https://doi.org/10.54105/ijdm.b1614.111221
Nowadays, the use of social media has increased exponentially. People show different behavior on social media depending on the kind of responses and behavior of people around them. It is important now to analyze the behavior of social media users and the way how they affect their friends. In this paper, behavioral analysis of people is done based on Twitter data. An algorithm is proposed which helps in finding the impact of text written by someone on social media and its effect on others. The impact of written text is calculated with the help of the number of retweets done for the same tweet. The severity of the used word is calculated based on AFINN dictionary. According to the proposed algorithm, the score of the dictionary is recalculated when a negative word is forwarded multiple times. This is done with the understanding that if a less severe negative word is used many times, it may affect the person in a highly negative manner. With this, Severity of words is recalculated and its impact on people is found with the help of the proposed algorithm. The impact of using negative words on social media affect 32 % of the total users (in their friend-list). Behavior change is demonstrated with the help of graphs week-wise, month-wise and year-wise analyses. The research helps in finding the impact of swear words on social media users depending on the frequency and severity score of the words.
Published: 10 November 2021
Indian Journal of Data Mining, Volume 1, pp 1-13; https://doi.org/10.54105/ijdm.a1608.111221
The Mineral and Mining industry is a huge energy requiring sector which demands connections to a viable electric power origin and reference. With the upsurge of these mineral requests and decreasing valued grades of ores, energy aspiration is approximated to sky-rocket to 36% by the year 2035. It is even projected to accelerate geometrically from the fact that sophistication and powering of the mine locations speeds up the stability into the necessity of energy applications and its requirements derived from fossil fuels utilized to generate electricity. This paper discusses other fossilized fuel-based materials for generation of electricity.
Indian Journal of Data Mining, Volume 1, pp 1-6; https://doi.org/10.54105/ijdm.a1603.051121
Machine learning which is a sub-domain of an Artificial Intelligence which is finding various applications in manufacturing and material science sectors. In the present study, Deep Generative Modeling which a type of unsupervised machine learning technique has been adapted for the constructing the artificial microstructure of Aluminium-Silicon alloy. Deep Generative Adversarial Networks has been used for developing the artificial microstructure of the given microstructure image dataset. The results obtained showed that the developed models had learnt to replicate the lining near the certain images of the microstructures.