gmailparser

Info

Occasionally you will get Gmail artefacts in the web browser cache, so this console application will open each *.htm and *.html file within a given directory (and sub directories) parse out the message thread information e.g.

  • Message id
  • Email Address
  • Subject
  • First 50 characters of email
  • Attachments
  • Message Timestamp

There is a SANS article that explains the Gmail data:  http://sansforensics.wordpress.com/2008/09/19/forensic-gmail-artifact-analysis/

Basically the data is stored in the webpage as JSON (JS Object Notation) which is used to transmit data in AJAX web applications. JSON is a less verbose way than XML hence why it is used in high-performance web sites rather than XML to reduce the amount of data being transmitted.

The Gmail JSON uses a character to denote the data stored e.g. "t" = "Message List", "gn" = Account Name. The application uses an open-source JSON library for .NET called...wait for it...JSON.NET (http://www.codeplex.com/Json)

It use the application simply get a command prompt up and give it the pass to the web browser cache and it will output a CSV file called "gmailparser.csv" in the given directory. To change the output path, simply supply the output path as a second parameter e.g.

gmailparser.exe "C:\Internet Cache" "C:\Output\woanware.csv"

Requirements

  • Microsoft .Net 3.5 or Mono v2+ (Maybe)

Options:

Size

Colors