Difference between revisions of "GEDCOM"

From Gramps
Jump to: navigation, search
m (See also)
m (See also)
(7 intermediate revisions by 4 users not shown)
Line 14: Line 14:
 
* [http://heiner-eichmann.de/gedcom/charset.htm ANSEL and Unicode GEDCOM sample files]
 
* [http://heiner-eichmann.de/gedcom/charset.htm ANSEL and Unicode GEDCOM sample files]
 
* [http://www.familyhistoriansoftware.co.uk/compare/gedcom-test.htm Calico Pie's GEDCOM Coverage Test]
 
* [http://www.familyhistoriansoftware.co.uk/compare/gedcom-test.htm Calico Pie's GEDCOM Coverage Test]
 +
* [http://ancestorsnow.blogspot.com/2011/07/vged.html VGed] (Windows OS)
 +
* [http://www.tamurajones.net/GEDCOMValidation.xhtml Gedcom Validation] by Tamura Jones.
  
 
==Converting ANSEL to Unicode and back==
 
==Converting ANSEL to Unicode and back==
* GRAMPS exports and imports ANSEL GEDCOM, but not all codes are supported. Test the output on the character ? which is used if a character is not understood
+
* Gramps exports and imports ANSEL GEDCOM, but not all codes are supported. Test the output on the character ? which is used if a character is not understood
* Perl MARC::Charset module: Available on  [http://search.cpan.org/~esummers/MARC-Charset-0.95/lib/MARC/Charset.pm cpan], and well supported . Can convert MARC8 to Unicode and back. As ANSEL is a subset of MARC8 (consisting of the (extended) latin characters) it will convert all ANSEL GEDCOMS to Unicode. (:arrow:) More info, and a script that can convert: see [http://bugs.gramps-project.org/view.php?id=831 attachment to bug 831].
+
* Perl MARC::Charset module: Available on  [http://search.cpan.org/~esummers/MARC-Charset-0.95/lib/MARC/Charset.pm cpan], and well supported . Can convert MARC8 to Unicode and back. As ANSEL is a subset of MARC8 (consisting of the (extended) latin characters) it will convert all ANSEL GEDCOMS to Unicode. (:arrow:) More info, and a script that can convert: see the attachment to bug {{bug|831}}.
 
* [http://marc4j.tigris.org/ marc4j] - Java implementation of a convertor, also MARC8 to Unicode. Unclear if it converts correctly combined unicode characters.
 
* [http://marc4j.tigris.org/ marc4j] - Java implementation of a convertor, also MARC8 to Unicode. Unclear if it converts correctly combined unicode characters.
  
Line 33: Line 35:
  
 
==See also==
 
==See also==
* [[GRAMPS XML]]
+
* [[Gramps XML]]
* [[Import from another program]]
+
* [[Import from another genealogy program]]
 
* [[Gramps and GEDCOM]]
 
* [[Gramps and GEDCOM]]
 +
* [[GEDCOM Extensions]]
 +
* [[Other_genealogy_tools#Gedcom_one-to-one_to_XML|ged1212xml]]
  
 
[[Category:Developers/Reference]]
 
[[Category:Developers/Reference]]
 
[[Category:Developers/General]]
 
[[Category:Developers/General]]

Revision as of 15:20, 7 June 2013

GEDCOM stands for Genealogical Data Communications and is a file format specification developed by the Family and Church History Department of The Church of Jesus Christ of Latter-day Saints. It allows different genealogical software programs to share data with each other.

GEDCOM References

GEDCOM Testing

Converting ANSEL to Unicode and back

  • Gramps exports and imports ANSEL GEDCOM, but not all codes are supported. Test the output on the character ? which is used if a character is not understood
  • Perl MARC::Charset module: Available on cpan, and well supported . Can convert MARC8 to Unicode and back. As ANSEL is a subset of MARC8 (consisting of the (extended) latin characters) it will convert all ANSEL GEDCOMS to Unicode. (:arrow:) More info, and a script that can convert: see the attachment to bug 831.
  • marc4j - Java implementation of a convertor, also MARC8 to Unicode. Unclear if it converts correctly combined unicode characters.

ANSEL/MARC References

Unofficial ANSEL to Unicode conversion lists

These cover the original ANSEL spec, now preseded by MARC8.

See also