Tag, You're It

One of the most frequent complaints I get about my photo album is that it's too linear - i.e., with over two thousand photos in there, some people would rather browse by topic/keyword than follow a time line or click around randomly.

But to do so requires proper handling of metadata, and we all know how is at storing and manipulating the metadata in the files themselves - not only does it not handle all / fields I need, but it insists on storing the metadata separately and is inconsistent as hell on what it imports and exports.

This is something I've been complaining about for years (just check out my node or search for "metadata"), and I've been coding my own web-based tagger .

The trouble is that there have been two things putting me off time and again:

  1. the lack of a decent library to write data
  2. my utter lack of time to code (which includes being able to do a decent UI with )

My current Snakelets prototype works for reading, adding comments and using a sort of "poor man's tagging". This consists of adding keywords to the Comments field (like my ), and works fine as a general technique.

The real trouble is that the time and energy required to polish the application to a level where I could actually use it on a daily basis was too much. Maybe going all Web 2.0 on it wasn't a good idea, since making sure thumbnails were cached and generated properly, doing the "automagic" completion UI, getting drag and drop to work, etc., etc. turned out to be a major drag (pun intended).

I'm finicky to the point of spending hours getting the interaction model down to something I can use without thinking and re-painting pixels to line up just right, and dealing with JavaScript and several browser quirks was getting on my nerves - which meant less and less time (or will power) to make any significant progress.

Scripting Judo

So I decided to turn the problem around - instead of rolling my own, I am now going back to where I was , trying to make the best out of what (and quite a few other folk out there) already built. Which, since it means using as the main UI, involves lots and lots of .

The only catch is that it all has to comply with my approach - i.e., the end result must allow me to export a set of photos with proper / tagging, even if the tools themselves are fairly proprietary.

As a starting point, I activated the Script Menu on my and grabbed Andrew Turner's set of s. He uses ExifTool (yes, yes, I know, it's a Perl-based utility - pfig will harp on about this for weeks), and the download archive includes three basic scripts:

  • One to set copyright and ownership information
  • One to set GPS tags (through a series of dialogues)
  • And another to set data (titles, comments, keywords, etc.) based on metadata.

The thing is, these are only one-way ( to ). I already had some metadata (like photo titles) that I wanted to migrate into , so I tackled that problem first.

Prepping

Before I started doing my own tagger, I decided to store basic metadata on a text file with the same base name as my photos. Since all my photos are into a YYYYMMDDHHMMSS.jpg pattern, it was fairly simple to maintain and update as necessary (even if a bit wasteful of disk space).

To import them into the right fields, I set up a local replica of my online photo album and ran the following script to merge the titles (which I had stored in .txt files alongside the photos) into the Title tag and remove some of the comments I already had:

#!/bin/sh

for FILE in *.txt
do
  IMAGE=${FILE%%.*}.jpg
  echo $IMAGE
  TITLE=`cat $FILE | head -1`
  exiftool -overwrite_original_in_place -title="$TITLE" -comment="" $IMAGE
done

Then I held Option (Alt for those of you who only know modern-era keyboards) down as I launched to create a brand new photo library for my tagging endeavors, and started playing around with .

to

Since Andrew's scripts only add tags based on metadata, I had to reverse the process and set metadata based on the info.

Once I figured out how to output data as a tab-separated list of names and values, it was fairly easy to do:

copy (my selected_images()) to these_images
if these_images is false or (the count of these_images) is 0 then ¬
  error "Please select at least one image."

repeat with i from 1 to the count of these_images
  set the keywordslist to ""
  set this_photo to item i of these_images
  -- Grab the image file name
  tell this_photo
    set the image_file to the image path
  end tell
  -- Ask exiftool for tab-separated fields
  set output to do shell script ¬
    "exiftool -t '" & image_file & "'"
  -- Split output into a list of lines
  set output to the paragraphs of output
  -- Prepare to split lines by tabs
  set AppleScript's text item delimiters to {ASCII character 9}
  repeat with fields in output
    set fields to fields's text items
    if item 1 of fields is "Title" then
      tell this_photo
        set title to the item 2 of fields
      end tell
    else if item 1 of fields is "Comment" then
      tell this_photo
        set comment to the item 2 of fields
      end tell
    end if
  end repeat
end repeat

display dialog "Exif reading complete."

This needs some clean-up and optimization (invoking exiftool for each file sucks, and I'll have to add keyword parsing later as well), but did the trick for the initial import of 2100 photos into a new library - I just selected them all and ran the script, which took a while.

Paging Gurus

So far so good. But my goal is to have fully symmetrical import/export of all the metadata I need, and so far I've come across two major problems:

  • There seems to be no way in the dictionary to programmatically set GPS attributes via .
  • If I set them manually via the filesystem, refuses to load them from the updated files.

Rebuilding the library doesn't seem to work, either.

Any input on this is welcome, since there seems to be no documentation on the right technique to use, and oral tradition (i.e., ) has led me nowhere.

Other stuff I intend to do is:

  • Find a way to preserve rankings (maybe using keywords, but I'd rather use a standard / field).
  • Get GPS tagging to work by invoking via Safari and parsing the location URL (which is pretty easy to do).
  • Adding notifications and some sort of a decent progress bar.
  • Building an "export" script to archive my photos on a network drive and update a list of MD5 checksums.

...which probably means I will be playing around with Studio and build some sort of UI instead of relying on the Scripts menu.

Update: dth pointed me to Sven-S. Porst's approach to Geotagging, which includes a -aware script but which does not solve one of the fundamental issues - to get to refresh its own metadata and display the new GPS tags.

Keywords Galore

Now that I have a fancy UI to do all sorts of group tagging, titling, commenting and whatnot, all that's left is to use it, right?

Well, as it turns out, 's tagging UI sucks. I would even go as far as to consider it utterly useless, since it seems to have been designed for people who will seldom tag photos (if at all), and requires you to do two things I find positively stupid:

  • You have to invoke the Info panel and use the mouse to hit a checkbox per keyword (which means lots of scrolling around and clicking if you have more than a dozen keywords)
  • You can't define new keywords on the fly - You have to go back to Preferences to add them.

As it turns out, there is a ready fix for that - Ken Ferry's Keyword Assistant makes it relatively painless to do mass tagging and lets you define keywords on the fly.

Plus, with a bit of practice, you can use it almost exclusively via the keyboard.

Update: James Webster pointed out in the comments that there is now a Universal version of Keyword Assistant released the very day I posted this, so I revised the text below.

The first version I got had two catches, though: It wasn't a Universal binary (something that isn't important for me yet, but which bears noting) and it was written for 6.0.2, which means that it couldn't automatically add keywords to 6.0.4 unless you fired up your favorite hex editor and applied this fix.

I'm using the freshly updated Universal version now, and it is full of neat tricks (such as holding down modifier keys to remove tags). It has certainly saved me a lot of time, although my back-up plan was to add scripting actions to , which is something I will explore later (since it seems feasible to invoke , type in a bunch of keywords and have them applied to the current selection via ).

Now, Where Did I Shoot That?

I'm now going through the 2100 photos in this site's photo album, and to further ease the process I've created a set of smart folders to save me a lot of time:

  • A few smart folders for specific keywords that I know will be associated with many photos (Lisbon, Flowers, etc.)
  • A smart folder for photos with invalid titles (all untitled photos have their title set to their filename)
  • A smart folder for untagged photos (which is where I spend most of my time)

And yeah, it takes some doing. Fortunately, I can still remember where I shot most of my photos and in which context, but I've come across one or two fun puzzles already. It's odd to think that it all comes back to me when I browse the album at random, but that it sneaks around the corner of my mind when I'm actually trying to remember.

I can't wait to figure out how to set GPS tags properly - clicking around on a map is going to add an entire new dimension to this sort of mind game...

This page is referenced in: