Search Mailing List Archives

Limit search to: Subject & Body Subject Author
Sort by: Reverse Sort
Limit to: All This Week Last Week This Month Last Month
Select Date Range     through    

[liberationtech] Metadata Cleanup trough File Format Convertion?

Fabio Pietrosanti (naif) lists at
Wed Jul 17 09:28:39 PDT 2013

Hi all,

i've been thinking about the topic of metadata cleanup of files from an 
implementation point of view.

Regardless the consideration whether it's something useful or not for a 
Whistleblowing platform (GlobaLeaks), i've been considering whenever the 
"Metadata Cleanup" can't be approached by "File Format Conversion".

If i'd like to remove metadata from various documents formats (pdf, 
word, ppt, excel, etc) or image file, i've been thinking that rather 
then "explicitly removing metadata" a possible different approach would 
be by doing a "file convertion" .

If a JPEG is converted to PNG, "maybe" all metadatas are lost. (this has 
to be verified)
If a DOC/DOCX is converted to a PDF, maybe all metadatas are lost.

At GlobaLeaks we've been discussing about introducing "metadata cleanup" 
[1] , but also a "file sterilization" [2] with the goal to protect 
Receivers of a Whistleblowing site against targeted 0day attacks.

Should we approach "metadata cleanup" by doing the "file sterilization" 
processing trough existing Libreoffice convertion API [3] to save 
engineering effort/time?

[1] Metadata Cleanup
[2] File Sterilization
[3] Libreoffice Convertion API

Fabio Pietrosanti (naif)
HERMES - Center for Transparency and Digital Human Rights - -

More information about the liberationtech mailing list