ValueError: Failed to convert string to float:

Question

ValueError: Failed to convert string to float:

I follow this guide to write a Naive Bayes classifier: http://machinelearningmastery.com/naive-bayes-classifier-scratch-python/

I keep getting this error:

dataset[i] = [float(x) for x in dataset[i]]
ValueError: could not convert string to float:

Here is the part of my code where the error occurs:

def loadDatasetNB(filename):
    lines = csv.reader(open(filename, "rt"))
    dataset = list(lines)
    for i in range(len(dataset)):
        dataset[i] = [float(x) for x in dataset[i]]
    return dataset

And here is how the file is called:

def NB_Analysis():
    filename = 'fvectors.csv'
    splitRatio = 0.67
    dataset = loadDatasetNB(filename)
    trainingSet, testSet = splitDatasetNB(dataset, splitRatio)
    print('Split {0} rows into train={1} and test={2} rows').format(len(dataset), len(trainingSet), len(testSet))
    # prepare model
    summaries = summarizeByClassNB(trainingSet)
    # test model
    predictions = getPredictionsNB(summaries, testSet)
    accuracy = getAccuracyNB(testSet, predictionsNB)
    print('Accuracy: {0}%').format(accuracy)

NB_Analysis()

My fvectors.csv file looks like this

What is wrong here, and how can I fix it?

+4

python scikit-learn csv machine-learning

Thom elliott Mar 26 '17 at 15:00

source share

4 answers

Taras Matsyk · Answer 1 · 2017-03-26T15:03:58+0000

Try skipping the heading; an empty heading in the first column causes the problem.

>>> float(' ')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ValueError: could not convert string to float:

If you want to skip the header, you can achieve it:

def loadDatasetNB(filename):
    lines = csv.reader(open(filename, "rt"))
    next(reader, None)  # <<- skip the headers
    dataset = list(lines)
    for i in range(len(dataset)):
        dataset[i] = [float(x) for x in dataset[i]]
    return dataset

(2) Or you can just ignore the exception:

try:
    float(element)
except ValueError:
    pass

(2), , , , .

James · Answer 2 · 2017-03-26T15:09:41+0000

, python square circle. , , .

:

def loadDatasetNB(filename):
    with open(filename, 'r') as fp:
        reader= csv.reader(fp)
        # skip the header line
        header = next(reader)
        # save the features and the labels as different lists
        data_features = []
        data_labels = []
        for row in reader:
            # convert everything except the label to a float
            data_features.append([float(x) for x in row[:-1]])
            # save the labels separately
            data_labels.append(row[-1])
    return data_features, data_labels

Yuval Pruss · Answer 3 · 2017-03-26T15:05:22+0000

.

>> float('')
ValueError: could not convert string to float:

:

dataset[i] = [float(x) for x in dataset[i] if x != '']

Julien · Answer 4 · 2017-03-26T15:05:52+0000

float, , , :

dataset[i] = [float(x) for x in dataset[i]]

, , for, :

data = []
for x in dataset[i]:
    try:
        value = float(x)
    except ValueError:
        value = x
    data.append(value)
dataset[i] = data

. :

/ Python: ?

ValueError: Failed to convert string to float:

More articles: