Error using TM VCorpus package in R

Question

Error using TM VCorpus package in R

I encountered an error below when working with a TM package with R.

library("tm")
Loading required package: NLP
Warning messages:
1: package ‘tm’ was built under R version 3.4.2 
2: package ‘NLP’ was built under R version 3.4.1

corpus <- VCorpus(DataframeSource(data))

Error: all (! Is.na (match (c ("doc_id", "text"), names (x)))) is not TRUE

We tried various methods, such as reinstalling the package, updating with the new version of R, but the error still persists. For the same data file, the same code runs on a different system with the same version of R.

+8

r text mining tm text analysis

Saharsh gandhi Nov 21 '17 at 6:27

source share

1 answer

Eva · Answer 1 · 2017-11-29T08:02:14+0000

I met the same problem when I upgraded the package tmto version 0.7-2. I was looking for details DataframeSource(), he mentioned:

"doc_id" . "".

x . "doc_id" . "" "UTF-8", . .

:

df_cmp<- read.csv("test_file.csv",stringsAsFactors = F)

df_title <- data.frame(doc_id=row.names(df_cmp),
                       text=df_cmp$English.title)

doc_id text.

Error using TM VCorpus package in R

More articles: