Search for:
Email Address

business@nitinetes.com

Hotline Support

+62 821 8888 7045

Location

Gedung Artha Graha 26F (OfficeNow), SCBD, Sudirman Street, DKI Jakarta

Our Services

Social Media Management

In the dynamic digital landscape, where conversations unfold at lightning speed and trends emerge with every scroll, social media marketing stands as the cornerstone of our strategy at Nitinetes Digital Solution.

Read More

Website Design & Development

A well-designed and developed website is a valuable asset for any business or organization in today's digital age. It not only serves as an online presence but also plays a significant role in marketing, customer engagement, and brand building

Read More

Community, Buzzer, Influencer

In the ever-evolving world of digital marketing, the trifecta of Community Engagement, Buzzer Marketing, and Influencer Collaboration stands as a dynamic strategy that can propel your brand to new heights of online success.

Read More
image
image
image
image
image
image

Our Features

Influencer Marketing

Collaborate with industry influencers to expand your reach and enhance your brand's credibility

Data-Driven Insights

We leverage advanced analytics tools to provide you with actionable insights, ensuring your decisions are backed by real data.

Content Marketing

Our skilled content creators develop compelling, relevant, and engaging content that resonates with your audience.

Digital Strategy

Our experts design customized digital marketing strategies tailored to your unique business goals and target audience.

Social Media Management

Build a strong social media presence and engage with your followers across various platforms.

Web Development

Create a user-friendly and visually appealing website that converts visitors into customers.

image

Journal Review – A scalable and flexible basket analysis system for big transaction data in Spark

Welcome to our blog! 


Here, we proudly present the insights and key takeaways from our review of the journal “A Scalable and Flexible Basket Analysis System for Big Transaction Data in Spark.” This blog is part of our academic journey as postgraduate students in the Informatics Engineering program at Universitas Pamulang.

Under the guidance of our lecturer, Dr. Arya Adhyaksa Waskita, S.Si., M.Si., and as members of Group 4 (01MKME001 V344), we aim to explore and share the latest advancements in network and computer systems. Our team, consisting of Devi Nourma Mita Bela Oktalanti, Ibnu Syifa, M. Fajri Nurkholis, and Bagaswara Robbiantarto, has worked collaboratively to delve into the challenges of big data analytics and the innovative solutions offered in the selected journal.

This review not only highlights the significance of scalable and flexible systems for handling vast transactional data but also bridges the gap between theoretical knowledge and practical applications in the field of big data. We hope this blog provides valuable insights and sparks interest among readers who are keen to explore data analytics and its transformative impact across industries.

Thank you for visiting our blog, and we look forward to engaging discussions and knowledge sharing. Enjoy reading! 

Basket analysis is an essential analytical task widely used in retail, finance, logistics, e-commerce, and bioinformatics to uncover associations between items frequently purchased together. Traditional basket analysis techniques, such as Apriori, Eclat, and FP-Growth algorithms, have been foundational in discovering frequent itemsets. However, these approaches face significant challenges in the era of big data due to issues of scalability and adaptability to diverse business tasks. This paper addresses these issues by proposing a new distributed frequent itemset mining (FIM) algorithm, ScaDistFIM, and a basket analysis system implemented in Spark.

The study aims to overcome two major limitations:

  1. Scalability: Traditional FIM algorithms fail to handle terabyte-scale datasets effectively due to memory and computational constraints.

  2. Flexibility: Existing systems lack the adaptability to address varied and evolving business requirements in basket analysis tasks.

The proposed system operates in two stages:

  1. Frequent Itemset Mining (FIM):

    • The ScaDistFIM algorithm employs a divide-and-conquer approach, partitioning transaction data into random subsets.

    • Local frequent itemsets are mined in parallel using a sequential FP-Growth algorithm.

    • An approximate set of frequent itemsets is derived using a voting mechanism to combine local results.

  2. Integration and Querying:

    • A flexible data model integrates the mined frequent itemsets with external business-relevant attributes (e.g., price, location, category).

    • Extended BI (Base Itemset) tables support SQL queries to enable diverse basket analysis tasks.

The ScaDistFIM algorithm is implemented in Spark, leveraging its distributed computing capabilities for efficient data processing.

The system’s performance was evaluated on both real-world and synthetic datasets, demonstrating the following:

  1. Efficiency:

    • The ScaDistFIM algorithm showed a 90% reduction in execution time compared to the Spark FP-Growth algorithm.

    • On a dataset with 1 billion records, ScaDistFIM completed the analysis in 360 seconds, whereas Spark FP-Growth failed due to memory limitations.

    • preencoded.png

  2. Accuracy:

    • The algorithm achieved near-perfect precision and recall, with minimal average support error (approximately 1%).

    • preencoded.png

  3. Scalability:

    • The system processed datasets up to 1 terabyte effectively, demonstrating scalability to billions of transactions.

  4. Flexibility:

    • The integration of external attributes enabled diverse analysis tasks, such as identifying high-value baskets, localized promotions, and inventory optimization.

The ScaDistFIM algorithm’s use of an approximate FIM method addresses scalability challenges by minimizing inter-node communication and reducing memory requirements. The two-stage process separates data mining from analytical querying, allowing users to flexibly adapt the system for different business needs without reprocessing the entire dataset.

However, the reliance on approximate frequent itemsets may introduce false positives or negatives, though the error rates are reported to be negligible in practical scenarios. The system’s implementation in Spark ensures broad applicability and ease of integration with existing data processing pipelines.

This study contributes significantly to the field of big data analytics by proposing a scalable and flexible basket analysis system. The ScaDistFIM algorithm addresses critical bottlenecks in memory and computation, while the integration of extended BI tables provides versatility in supporting diverse business queries. Future work could explore enhancing the precision of approximate frequent itemsets and extending the system’s applicability to other domains beyond retail.

The proposed system offers retailers and other industries a powerful tool for harnessing large transaction datasets to uncover actionable insights. By integrating external business attributes, it bridges the gap between data mining and decision-making, paving the way for smarter, data-driven strategies.

PESO Model

Understanding the PESO Model: A Framework for Effective PR and Marketing

“10 Digital Marketing Trends to Watch in 2023”

The digital marketing landscape is continually evolving, with new technologies and strategies shaping the way brands connect with their audience. As we step into 2023, it’s essential to stay ahead of the curve by keeping an eye on the latest trends. Here are ten digital marketing trends that are set to make waves this year:

image