Awk edit only 1 column with regex

Question

I have a CSV file with three columns:

id,text,date
123,hi 你好吗？,2016-01-01
246,this is stackoverflow 我需要帮忙,2016-02-01

I only want to edit column 2, where I delete only English characters and keep Chinese ones. The remaining columns remain intact.

The output I want:

id,text,date
123,你好吗？,2016-01-01
246,我需要帮忙,2016-02-01

Is there a better way to do this than this:

cat myfile.csv|cut -d, -f2|sed 's/[a-zA-Z]*//g' > tmp.csv
paste -d, myfile.csv tmp.csv|awk -F, '{OFS=",";print $1,$7,$3}' >tmp2.csv

+4

jxn Jan 28 '16 at 23:29

4 answers

Fabricator · Answer 1 · 2016-01-28T23:51:53+0000

awk -F, 'BEGIN {OFS=","} { if (NR>1) {gsub(/[\x00-\x7F]/, "", $2)}; print }' test.txt

Ed morton · Answer 2 · 2016-01-29T01:07:04+0000

If the script you posted at the bottom of your question works for you, it will:

awk 'BEGIN{FS=OFS=","} NR>1{gsub(/[a-zA-Z]/,"",$2)} 1' file

You said “characters,” though not “letters,” but YMMV.

bian · Answer 3 · 2016-01-29T00:43:41+0000

awk -F, '{ s=split($2,t," "); sub($2, t[s]); print }' file
id,text,date
123,你好吗？,2016-01-01
246,我需要帮忙,2016-02-01

Hackaholic · Answer 4 · 2016-01-29T06:46:49+0000

awk 'NR==1{print;}NR>1{gsub(/[a-zA-Z ]+/,"");print;}' your_file
id,text,date
123,你好吗？,2016-01-01
246,我需要帮忙,2016-02-01