Difference between revisions of "Wikibench: a Distributed, Wikipedia based Web Application Benchmark"

From Wikipedia Quality
Jump to: navigation, search
(Starting an article - Wikibench: a Distributed, Wikipedia based Web Application Benchmark)
 
(+ links)
Line 1: Line 1:
'''Wikibench: a Distributed, Wikipedia based Web Application Benchmark''' - scientific work related to Wikipedia quality published in 2009, written by Erik-Jan van Baaren.
+
'''Wikibench: a Distributed, Wikipedia based Web Application Benchmark''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Erik-Jan van Baaren]].
  
 
== Overview ==
 
== Overview ==
Many different, novel approaches have been taken to improve throughput and scalability of distributed web application hosting systems and relational databases. Yet there are only a limited number of web application benchmarks available. Authors present the design and implementation of WikiBench, a distributed web application benchmarking tool based on Wikipedia. WikiBench is a trace based benchmark, able to create realistic workloads with thousands of requests per second to any system hosting the freely available Wikipedia data and software. Authors obtained completely anonymized, sampled access traces from the Wikimedia Foundation, and authors created software to process these traces in order to reduce the intensity of its traffic while still maintaining the most important properties such as inter-arrival times and distribution of page popularity. This makes WikiBench usable for both small and large scale benchmarks. Initial benchmarks show a regular day of traffic with its ups and downs. By using median response times, authors are able to show the effects of increasing traffic intensities on system under test.
+
Many different, novel approaches have been taken to improve throughput and scalability of distributed web application hosting systems and relational databases. Yet there are only a limited number of web application benchmarks available. Authors present the design and implementation of WikiBench, a distributed web application benchmarking tool based on [[Wikipedia]]. WikiBench is a trace based benchmark, able to create realistic workloads with thousands of requests per second to any system hosting the freely available Wikipedia data and software. Authors obtained completely anonymized, sampled access traces from the [[Wikimedia Foundation]], and authors created software to process these traces in order to reduce the intensity of its traffic while still maintaining the most important properties such as inter-arrival times and distribution of page popularity. This makes WikiBench usable for both small and large scale benchmarks. Initial benchmarks show a regular day of traffic with its ups and downs. By using median response times, authors are able to show the effects of increasing traffic intensities on system under test.

Revision as of 13:29, 21 November 2020

Wikibench: a Distributed, Wikipedia based Web Application Benchmark - scientific work related to Wikipedia quality published in 2009, written by Erik-Jan van Baaren.

Overview

Many different, novel approaches have been taken to improve throughput and scalability of distributed web application hosting systems and relational databases. Yet there are only a limited number of web application benchmarks available. Authors present the design and implementation of WikiBench, a distributed web application benchmarking tool based on Wikipedia. WikiBench is a trace based benchmark, able to create realistic workloads with thousands of requests per second to any system hosting the freely available Wikipedia data and software. Authors obtained completely anonymized, sampled access traces from the Wikimedia Foundation, and authors created software to process these traces in order to reduce the intensity of its traffic while still maintaining the most important properties such as inter-arrival times and distribution of page popularity. This makes WikiBench usable for both small and large scale benchmarks. Initial benchmarks show a regular day of traffic with its ups and downs. By using median response times, authors are able to show the effects of increasing traffic intensities on system under test.