If you have large cases in some "unusual" languages (in the sense of "those for which limited amounts of computational linguistics have been performed"), repeating some existing work on computational linguistics already done for very popular languages (such as English, Chinese, Arabic, ...) is a completely suitable project (especially in an academic setting, but it can be quite suitable for the industry too - back when I was in computational linguistics with IBM Research. I got an interesting run from the volume The case for the Italian and the repetition [[at the relatively new IBM science center in Rome]] is very similar to what the IBM Research team at Yorktown Heights [[of which I was part]] has already done for the English language.
/ ( , IBM Italy, , ).
, , : ( ..), "" ? , , , , , , , , , , , , .
, , "" , ? ( ), ( , , - CL !).