Abstract:
With the growth of e-commerce, the size of available data has grown to an incalculable level. It has been an excellent opportunity for companies to leverage those data to derive intelligent information. However, limiting factor is the inability of traditional learning algorithms to process such a large dataset within a reasonable time. Moreover, the data in the e-commerce domain contains many unstructured and unreliable data sources. As a result, the databases arefilled with noisy data. On the other hand, the traditional methods do not perform well on noisy data. In order to overcome this problem,distributed machine-learning techniques arebecoming ever more popular within the research communities. In this project, multi-agent based distributed computing environment has used for segmenting consumers using Artificial Neural Network (ANN) on e-commerce dataset. Hopfield NeuralNetwork modelhas used to cluster the customer base in a perspective of marketing segmentation. Data clustering mechanism isimplemented with multi-agent technologies on distributed environment. The data partitioning techniques such as modular base approaches have usedto process the ANN on distributed computing nodes. Further, multiple outputs are generated by different processing nodes have aggregated by querying the nearest cluster centroid for the given node. The application tasks such as data partitioning, consumer clustering, result combining and, etc. have implemented as agents. Further, the clustering agents are implemented to utilize the capability of heterogeneous computing environment, which has GPU and CPU. Due to the platform independent nature on multi-agent systems, the application can be deployed on a workstation that has various hardware and software configurations while utilizing either GPU or CPU for data computation.