Package 'pullword'

Title: R Interface to Pullword Service
Description: R Interface to Pullword Service for natural language processing in Chinese. It enables users to extract valuable words from text by deep learning models. For more details please visit the official site (in Chinese) <http://www.pullword.com/>.
Authors: Tong He <[email protected]>
Maintainer: Tong He <[email protected]>
License: LGPL
Version: 0.3
Built: 2024-11-07 03:25:41 UTC
Source: https://github.com/hetong007/pullword

Help Index


Indicate whether the encoding of input string is BIG5.

Description

Indicate whether the encoding of input string is BIG5.

Usage

isBIG5(string, combine = FALSE)

Arguments

string

A character vector.

combine

Whether to combine all the strings.

Value

Logical value.

Author(s)

Jian Li <[email protected]>


Indicate whether the encoding of input string is GB18030.

Description

Indicate whether the encoding of input string is GB18030.

Usage

isGB18030(string, combine = FALSE)

Arguments

string

A character vector.

combine

Whether to combine all the strings.

Value

Logical value.

Author(s)

Jian Li <[email protected]>


Indicate whether the encoding of input string is GB2312.

Description

Indicate whether the encoding of input string is GB2312.

Usage

isGB2312(string, combine = FALSE)

Arguments

string

A character vector.

combine

Whether to combine all the strings.

Value

Logical value.

Author(s)

Jian Li <[email protected]>


Indicate whether the encoding of input string is GBK.

Description

Indicate whether the encoding of input string is GBK.

Usage

isGBK(string, combine = FALSE)

Arguments

string

A character vector.

combine

Whether to combine all the strings.

Value

Logical value.

Author(s)

Jian Li <[email protected]>


Indicate whether the encoding of input string is UTF-8.

Description

Indicate whether the encoding of input string is UTF-8.

Usage

isUTF8(string, combine = FALSE)

Arguments

string

A character vector.

combine

Whether to combine all the strings.

Value

Logical value.

Author(s)

Jian Li <[email protected]>


R Interface of PullWord service

Description

This function deals with communication with the server. The result will be parsed in a vector or a matrix, depends on show probability or not

pullword

Usage

pullword(input = NULL, file = NULL, threshold = 0, showProb = FALSE)

Arguments

input

The input text.

file

The input file.

threshold

The minimum probability for the words appearing in the result. Should be a real value between 0 and 1.

showProb

logical. The return value would be a data.frame if TRUE, or a vector otherwise.

Examples

require(pullword)
pullword('Replace this field with a Chinese sentence.',threshold=0,showProb=TRUE)

Convert encoding of Chinese string to UTF-8.

Description

Convert encoding of Chinese string to UTF-8.

Usage

toUTF8(cnstring)

Arguments

cnstring

A Chinese string vector.

Value

Converted vectors.

Author(s)

Jian Li <[email protected]>