Translations:Wikipedia Quality/13/en

From Wikipedia Quality
Jump to: navigation, search

Data Mining

To build such models, you can use various algorithms, in particular Data Mining. One of the most commonly used algorithms – Random Forest[1][2][3][4][5][6][7][8]. There are even studies[4], which compare it with other algorithms (CART, SMO, Multilayer Perceptron, LMT, C4.5, C5.0 and others). Random Forest allows to build models even using variables that correlates with each other. Additionally, this algorithm can show which variables are more important for determining the quality of articles. If we need to get other information about the importance of variables, we can use other algorithms, including logistic regression.[9]
  1. Cite error: Invalid <ref> tag; no text was provided for refs named art1
  2. Cite error: Invalid <ref> tag; no text was provided for refs named art2
  3. Cite error: Invalid <ref> tag; no text was provided for refs named art3
  4. 4.0 4.1 Cite error: Invalid <ref> tag; no text was provided for refs named art4
  5. Cite error: Invalid <ref> tag; no text was provided for refs named art5
  6. Cite error: Invalid <ref> tag; no text was provided for refs named art6
  7. Cite error: Invalid <ref> tag; no text was provided for refs named art7
  8. Cite error: Invalid <ref> tag; no text was provided for refs named art20
  9. Cite error: Invalid <ref> tag; no text was provided for refs named art13