tm - Text Mining Package
A framework for text mining applications within R.
Last updated
cpp
12.99 score 91 dependents 20k scripts 57k downloadstseries - Time Series Analysis and Computational Finance
Time series analysis and computational finance.
Last updated
openblas
10.94 score 4 stars 108 dependents 13k scripts 171k downloadsclue - Cluster Ensembles
CLUster Ensembles.
Last updated
10.47 score 2 stars 445 dependents 704 scripts 79k downloadsmlbench - Machine Learning Benchmark Problems
A collection of artificial and real-world machine learning benchmark problems, including, e.g., several data sets from the UCI repository.
Last updated
9.37 score 2 stars 81 dependents 5.6k scripts 54k downloadsNLP - Natural Language Processing Infrastructure
Basic classes and methods for Natural Language Processing.
Last updated
9.00 score 6 stars 123 dependents 1.2k scripts 36k downloadsslam - Sparse Lightweight Arrays and Matrices
Data structures and algorithms for sparse arrays and matrices, based on index arrays and simple triplet representations, respectively.
Last updated
openblas
8.85 score 2 stars 354 dependents 1.3k scripts 62k downloadsRWeka - R/Weka Interface
An R interface to Weka (Version 3.9.3). Weka is a collection of machine learning algorithms for data mining tasks written in Java, containing tools for data pre-processing, classification, regression, clustering, association rules, and visualization. Package 'RWeka' contains the interface code, the Weka jar is in a separate package 'RWekajars'. For more information on Weka see <https://www.cs.waikato.ac.nz/ml/weka/>.
Last updated
openjdk
8.79 score 4 stars 14 dependents 1.9k scripts 9.7k downloadschron - Chronological Objects which Can Handle Dates and Times
Provides chronological objects which can handle dates and times.
Last updated
8.68 score 112 dependents 2.8k scripts 52k downloadsISOcodes - Selected ISO Codes
ISO language, territory, currency, script and character codes. Provides ISO 639 language codes, ISO 3166 territory codes, ISO 4217 currency codes, ISO 15924 script codes, and the ISO 8859 character codes as well as the UN M.49 area codes.
Last updated
6.11 score 91 dependents 204 scripts 23k downloadsskmeans - Spherical k-Means Clustering
Algorithms to compute spherical k-means partitions. Features several methods, including a genetic and a fixed-point algorithm and an interface to the CLUTO vcluster program.
Last updated
5.71 score 2 stars 22 dependents 124 scripts 5.3k downloadsopenNLP - Apache OpenNLP Tools Interface
An interface to the Apache OpenNLP tools (version 1.5.3). The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text written in Java. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. See <https://opennlp.apache.org/> for more information.
Last updated
openjdk
5.56 score 4 stars 11 dependents 448 scripts 1.5k downloadsmovMF - Mixtures of von Mises-Fisher Distributions
Fit and simulate mixtures of von Mises-Fisher distributions.
Last updated
5.50 score 1 stars 18 dependents 69 scripts 1.4k downloadsrelations - Data Structures and Algorithms for Relations
Data structures and algorithms for k-ary relations with arbitrary domains, featuring relational algebra, predicate functions, and fitters for consensus relations.
Last updated
5.49 score 14 dependents 61 scripts 6.0k downloadsRsymphony - SYMPHONY in R
An R interface to the SYMPHONY solver for mixed-integer linear programs.
Last updated
coinor-symphony
5.07 score 6 dependents 93 scripts 7.1k downloadsdate - Functions for Handling Dates
Functions for handling dates.
Last updated
4.93 score 8 dependents 720 scripts 2.5k downloadstau - Text Analysis Utilities
Utilities for text analysis.
Last updated
4.38 score 5 dependents 141 scripts 2.8k downloadsRWekajars - R/Weka Interface Jars
External jars required for package 'RWeka'.
Last updated
openjdk
4.08 score 15 dependents 45 scripts 5.9k downloadsUnicode - Unicode Data and Utilities
Data from Unicode 17.0.0 and related utilities.
Last updated
4.02 score 4 dependents 145 scripts 814 downloadsbindata - Generation of Artificial Binary Data
Generation of correlated artificial binary data.
Last updated
3.63 score 3 dependents 239 scripts 646 downloadsopenNLPdata - Apache OpenNLP Jars and Basic English Language Models
Apache OpenNLP jars and basic English language models.
Last updated
openjdk
3.38 score 12 dependents 40 scripts 1.7k downloadstextcat - N-Gram Based Text Categorization
Text categorization based on n-grams.
Last updated
3.12 score 3 stars 212 scripts 1.0k downloadscclust - Convex Clustering Methods and Clustering Indexes
Convex Clustering methods, including K-means algorithm, On-line Update algorithm (Hard Competitive Learning) and Neural Gas algorithm (Soft Competitive Learning), and calculation of several indexes for finding the number of clusters in a data set.
Last updated
2.86 score 1 dependents 48 scripts 685 downloadsoz - Plot the Australian Coastline and States
Functions for plotting Australia's coastline and state boundaries.
Last updated
2.77 score 100 scripts 5.9k downloadsOAIHarvester - Harvest Metadata Using OAI-PMH Version 2.0
Harvest metadata using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) version 2.0 (for more information, see <https://www.openarchives.org/OAI/openarchivesprotocol.html>).
Last updated
2.56 score 8 scripts 3.6k downloadsRKEA - R/KEA Interface
An R interface to KEA (Version 5.0). KEA (for Keyphrase Extraction Algorithm) allows for extracting keyphrases from text documents. It can be either used for free indexing or for indexing with a controlled vocabulary. For more information see <http://www.nzdl.org/Kea/>.
Last updated
openjdk
2.00 score 1 stars 7 scripts 217 downloadsRpoppler - PDF Tools Based on Poppler
PDF tools based on the Poppler PDF rendering library. See <http://poppler.freedesktop.org/> for more information on Poppler.
Last updated
popplerglib
1.68 score 8 scripts 4.7k downloadsW3CMarkupValidator - R Interface to W3C Markup Validation Services
R interface to a W3C Markup Validation service. See <https://validator.w3.org/> for more information.
Last updated
1.48 score 3 scripts 682 downloadsRKEAjars - R/KEA Interface Jars
External jars required for package RKEA.
Last updated
openjdk
1.48 score 1 dependents 203 downloadstm.plugin.mail - Text Mining E-Mail Plug-in
A plug-in for the tm text mining framework providing mail handling functionality.
Last updated
1.43 score 27 scripts 253 downloadsNLPutils - Natural Language Processing Utilities
Utilities for Natural Language Processing.
Last updated
openjdk
1.00 score 4 scripts 134 downloads