Reading Chinese Menus: Concepts: grep | Adventures with Kake

You're viewing

kake's journal
Create a Dreamwidth Account Learn More

Reload page in style: site light

kake

Just a quick post today, to mention one of the most useful computer tools I've found so far for helping me access and organise my vocab lists and transcribed menus — grep.

grep is a commandline tool that should be available on all Unixes (Linux, Solaris, OS X, etc), and on all those I have access to, it deals just fine with Chinese characters. This means that I can easily check through all my textfile documents to find, for example, dishes with prawns in: grep 蝦 *.txt

This is pretty powerful on its own, really, but the one thing it can't do is take account of simplified vs. traditional characters — and some of my lists/menus are copy-pasted from sources that use simplified characters, while the ones I've written/transcribed myself are in traditional characters.

So I wrote some Perl to make this easier, and you can find it on CPAN. It includes a commandline utility called dets (desensitise traditional-simplified) which builds a regexp from a string and can be used like so: grep `dets 蝦` *.txt (dets 蝦 returns [虾蝦]).

I realise I don't usually write about geek stuff on here, so eyes may be glazing over at this point — but if the owners of the remaining eyes have any comments, patches, or bug reports, I would love to hear them.

If you have any questions or corrections, please leave a comment (here's how) and let me know (or email me at kake@earth.li). See my introductory post to the Chinese menu project for what these posts are all about.

Flat | Top-Level Comments Only

From:

I think $() is better style than ``, but YMMV :)

From:

kake

That works too! Though I thought they were equivalent except that the former can be nested and the latter can't?

From:

The quoting for $() is a bit saner, too. I think you can nest the latter by `foo \` bar \` `, but ...

Flat | Top-Level Comments Only

Links

Tags

a-z walk [9]
brewpubs [1]
chinese menu [179]
- characters [51]
- concepts [51]
- dishes [61]
  - recipes [3]
- ingredients [4]
- meta [17]
chinese new year [7]
choir [12]
cider [2]
cooking [14]
discworld mud [13]
dreamwidth [1]
follow friday [2]
food questions [1]
food
- bulgarian food [1]
- burmese food [1]
- cheese [1]
- chinese food [64]
  - dim sum [20]
- ecuadorian food [1]
- french food [1]
- indian food [11]
- iranian food [1]
- italian food [1]
- japanese food [4]
- kazakh food [2]
- korean food [2]
- lebanese food [1]
- malaysian food [3]
- mongolian food [1]
- polish food [1]
- pub food [1]
- scandinavian food [1]
- sichuan food [1]
- sri lankan food [2]
- steak [1]
- sushi [3]
- syrian food [1]
- thai food [1]
- vegan food [2]
- vegetarian food [1]
- vietnamese food [4]
house [1]
language [1]
london [17]
london.pm [1]
london
- balham [1]
- barking [1]
- bayswater [2]
- bermondsey [1]
- brentford [1]
- camberwell [1]
- camden [1]
- chesham [1]
- chislehurst [1]
- chiswick [2]
- city of london [1]
- clapham [1]
- clerkenwell [1]
- colindale [2]
- ealing [1]
- earl's court [2]
- east ham [4]
- eastcote [2]
- edgware [1]
- epping [1]
- fulham [1]
- hammersmith [3]
- hampstead [2]
- hampton [1]
- harringay [1]
- harrow-on-the-hill [1]
- hendon [1]
- holloway [2]
- hornsey [1]
- isle of dogs [1]
- kennington [1]
- kew [1]
- kingston [2]
- leyton [1]
- leytonstone [1]
- london bridge [1]
- mayfair [1]
- metroland [1]
- new malden [1]
- norbiton [1]
- northfields [1]
- notting hill [1]
- oxford street [1]
- peckham [1]
- pinner [1]
- rayners lane [1]
- shepherd's bush [4]
- shoreditch [1]
- sloane square [1]
- south kensington [1]
- southall [1]
- st john's wood [1]
- surbiton [1]
- surrey quays [4]
- swiss cottage [1]
- teddington [1]
- theydon bois [1]
- tooting [2]
- upminster [2]
- upton park [1]
- walthamstow [1]
- walworth [1]
- waterloo [1]
- watford [1]
- whitechapel [1]
- willesden [1]
- wimbledon [1]
- wood green [2]
lunar new year [7]
maps [1]
os x [1]
perl [1]
photos [1]
poll [2]
postal districts
- br7 [1]
- cm16 [1]
- e1 [3]
- e10 [1]
- e11 [1]
- e12 [3]
- e14 [1]
- e17 [1]
- e2 [1]
- e6 [1]
- ec1 [2]
- ec2 [1]
- ec4 [1]
- ha1 [1]
- ha2 [1]
- ha3 [1]
- ha5 [2]
- ha8 [1]
- hp5 [1]
- ig11 [1]
- kt1 [1]
- kt2 [2]
- kt3 [1]
- kt6 [1]
- n1 [1]
- n22 [2]
- n4 [1]
- n7 [2]
- n8 [1]
- nw1 [1]
- nw10 [1]
- nw2 [1]
- nw3 [2]
- nw4 [1]
- nw8 [1]
- nw9 [2]
- rm14 [2]
- se1 [5]
- se11 [1]
- se15 [1]
- se16 [5]
- se17 [1]
- se5 [1]
- se8 [1]
- sw1 [1]
- sw12 [1]
- sw17 [2]
- sw19 [1]
- sw4 [1]
- sw5 [2]
- sw6 [1]
- sw7 [1]
- tw11 [1]
- tw12 [1]
- tw8 [1]
- tw9 [1]
- ub1 [1]
- w1 [3]
- w12 [4]
- w13 [1]
- w2 [2]
- w4 [2]
- w6 [3]
- wc2 [1]
- wd17 [1]
pubs [15]
restaurants [11]
rock band [1]
shameless self-promotion [1]
singing [2]
tea [1]
things i did last week [26]
three weeks for dreamwidth [74]
tubewalking [4]
unclutter_2009 [1]
wine [1]

December 2012

S	M	T	W	T	F	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31

Style Credit

Style: Black or White with Colour for Transmogrified by zvi and kake

Expand Cut Tags

No cut tags