Difference between revisions of "GEDCOM"

From Gramps
Jump to: navigation, search
(GEDCOM Testing)
(GEDCOM Testing)
Line 8: Line 8:
==GEDCOM Testing==
==GEDCOM Testing==
* [http://pjcj.sytes.net/pjcj.net/html/perl.html#Gedcom.pm Gedcom.pm]]
* [http://pjcj.sytes.net/pjcj.net/html/perl.html#Gedcom.pm Gedcom.pm]
* [http://www.genindre.org/ftp/perl/gedchk.zip gedchk (perl)]]
* [http://www.genindre.org/ftp/perl/gedchk.zip gedchk (perl)], [http://www.familysearch.org/GEDCOM/gedcheck.zip gedchk.exe]
* [http://www.familysearch.org/GEDCOM/gedcheck.zip gedchk.exe]
* [http://www.geditcom.com/gedcom.html GEDitCOM "Torture Test" files]
* [http://www.geditcom.com/gedcom.html GEDitCOM "Torture Test" files]
* [http://heiner-eichmann.de/gedcom/charset.htm ANSEL and Unicode GEDCOM sample files]
* [http://heiner-eichmann.de/gedcom/charset.htm ANSEL and Unicode GEDCOM sample files]

Revision as of 19:02, 6 January 2009

GEDCOM stands for Genealogical Data Communications and is a file format specification developed by the Family and Church History Department of The Church of Jesus Christ of Latter-day Saints. It allows different genealogical software programs to share data with each other.

GEDCOM References

  • GEDCOM 5.5 (Envoy, pdf, html)
  • Gedcom.pm
  • GEDCOM 5.5.1 draft (pdf)
  • GEDCOM 6.0 beta (pdf) - Note - this effort appears to have been abandoned

GEDCOM Testing

Converting ANSEL to Unicode and back

  • GRAMPS exports and imports ANSEL GEDCOM, but not all codes are supported. Test the output on the character ? which is used if a character is not understood
  • Perl MARC::Charset module: Available on cpan, and well supported . Can convert MARC8 to Unicode and back. As ANSEL is a subset of MARC8 (consisting of the (extended) latin characters) it will convert all ANSEL GEDCOMS to Unicode. (:arrow:) More info, and a script that can convert: see attachment to bug 831.
  • marc4j - Java implementation of a convertor, also MARC8 to Unicode. Unclear if it converts correctly combined unicode characters.

ANSEL/MARC References

Unofficial ANSEL to Unicode conversion lists

These cover the original ANSEL spec, now preseded by MARC8.

See also