Changes

Jump to: navigation, search

Gramps Performance

3,255 bytes added, 08:24, 5 January 2017
no edit summary
{{stub}}Comparison of performance on large datasets between different GRAMPS Gramps versions
==Performance tests==
It is important that GRAMPS Gramps performs well on datasets in the 10k to 30k range. A good benchmark is to test GRAMPS Gramps on a 100k range dataset, and keep track of performance with every new version.
Furthermore, this page can serve as proof to users that the present version of GRAMPS Gramps is '''not slow'''. From version 2.2.5 onwards, special attention will be given to performance, so that it does not deteriorate due to changes.
If you want to work with a large database, read [[Tips for large databases]].
==General setup==
Comparison should be with equal hardware, and on the same datasets to be fair. Optimal representation may be chosen, so for GRAMPSGramps, tests are done in the native database format, called GRAMPS GRDB format or GRAMPS XML format.
Should somebody want to publish results of commercial software under windows, this is allowed, but should be fair: same hardware and dataset, so test on a dual-boot machine, and use the internal format of the program.
=== Genealogical datasets ===
{|{{prettytable}}man warn|-Warning|[[Image:Gnome-important.png]]|<center style="font-size:110%">Private datasets will not be shared under any reason. <br><br>Free datasets are given under the following copyright: use for testing of genealogical programs only, no publication, no sharing. They have been created with free information on the net of which the users explicitly state it can be used freely. <br><br> Should you however feel certain data is misplaced, or the original author does not have the right to distribute the data, please contact us to remove any information necessary.</center>|}}
'''FAQ'''
* ''My computer hangs on open, eating memory?'' These are LARGE datasets, so do NOT open them directly. For GRAMPS Gramps open them as follows: create a new Family Tree. Open it and go to the import menu and import the dataset.
* ''What is tar.bz?'' This is a compression format. You must uncompress the file before importing it
* ''Can you provide the GEDCOM?'' No. Offering GEDCOM has the danger of attracting to much traffic to this site. If you need GEDCOM, you should install GRAMPSGramps, import the dataset, and then choose "Export to GEDCOM".
* ''What is in these files?'' See summary at the bottom of this page.
 
{| {{prettytable}}
|-
!Test Code !nameFile Name
!Download size
!People
!Size(MB)!CopyrightLicense|-|<!-- Code -->[[#Summary_of_database_test_d01|d01]]|<!-- File Name -->Doug's test GEDCOM|<!-- Download size --> - |<!-- People --> 100993|<!-- Size (MB) -->32MB|<!-- License -->Private|-|<!-- Code -->[[#Summary_of_database_test_d02|d02]]|<!-- File Name --><strike>testdb80000</strike>|<!-- Download size --> 11.2MB|<!-- People --> 82688 |<!-- Size (MB) -->70MB|<!-- License -->Testing only, no sharing, no publication<br>{{man menu|*** NOTE: THIS FILE IS MISSING.<br>IF ANYONE HAS A COPY, PLEASE CONTACT [email protected] ***}}|-|<!-- Code -->[[#Summary_of_database_test_d03|d03]]|<!-- File Name -->[http://www.gramps-project.org/files/stresstestdata/testdb120000.gramps.tar.gz testdb120000]|<!-- Download size --> 14.8MB |<!-- People --> 124032|<!-- Size (MB) -->88 MB|<!-- License -->Testing only, no sharing, no publication
|-
|d01<!-- Code -->[[#Summary_of_database_test_d03|d03_alternate]]|Doug's test GEDCOM<!-- File Name -->[http://www.gramps-project.org/files/stresstestdata/test_2011-09-07.gramps.tar.bz2 test_2011-09-07.gramps]| <!- - Download size --> 11.9MB | 100993<!-- People --> 124032|32MB<!-- Size (MB) -->88.4MB|Private<!-- License -->Testing only, no sharing, no publication (d03 for Gramps 3.3.x)
|-
|d02<!-- Code -->[[#Summary_of_database_test_d04|d04]]|<!-- File Name -->Jean-Raymond's test GEDCOM [http://wwwforum.gramps-projectgeneanet.org/files/stresstestdata/testdb80000index.php?topic=389170.gramps testdb800000 french forum]| 11.2MB<!-- Download size --> -| 82688 <!-- People --> 52699|70MB<!-- Size (MB) -->13.6MB|Testing only, no sharing, no publication *** NOTE: THIS FILE IS MISSING. IF ANYONE HAS A COPY, PLEASE CONTACT nick@gramps<!-- License --project.org ***>Private
|-
|d03<!-- Code -->[[#Summary_of_database_test_d05|d05]]|<!-- File Name -->[http://www.gramps-project.org/files/stresstestdata/testdb120000places.grampsplaces.tar.gz testdb120000gramps]| 14<!-- Download size --> 2.8MB 5MB | 124032<!-- People --> 65598 place objects|88 <!-- Size (MB) -->15.3MB|<!-- License -->Testing only, no sharing, no publication
|-
|d03_alternate<!-- Code -->[[#Summary_of_database_test_d05|d06]] (same as d05, but gramps42 format)|test_2011<!-09-07File Name -->[[Media:Places-2.gramps.zip]]| 17<!-- Download size --> 2.3MB 8MB | 124032<!-- People --> 65598 place objects (expanded)|88.4MB<!-- Size (MB) -->22MB|<!-- License -->Testing only, no sharing, no publication (d03 for Gramps 3.3.x)
|-
|d04<!-- Test Code -->|Jean<!-Raymond's test GEDCOM [http://forum.geneanet.org/index.php?topic=389170.0 french forum]- File Name -->| <!-- Download size -->| 52699<!-- People -->| 13.6MB<!-- Size (MB) -->|Private<!-- License -->
|}
{| {{prettytable}}
|-
!Hardware Code
!Processor
!clock
!RAM
!Storage<!--Type eg: HDD or SSD-->
!OS
!User
|-
|H01 || Pentium 4 || 2.66 GHz || 512 MB || HDD || Linux || ? |-|H02 || ? || 1.7 GHz || 512 MB || HDD || Linux || ?
|-
|H02 H03 || ? AMD Athlon64 X2 || 2x2.1.7 GHz || 512 MB 1 GB || HDD || Linux Kubuntu 6.06 || ?
|-
|H03 H04 || AMD Athlon64 X2 Intel Centrino Duo || 2x22x1.1 66 GHz || 1 2 GB || Kubuntu 6HDD || Ubuntu 9.06 04 || ?[[User:Duncan]]
|-
|H04 H05 || Intel Centrino Duo || 2x1.66 GHz || 2 GB || HDD || Ubuntu 98.04 10 || [[User:Duncan]]
|-
|H05 H06 || Intel Centrino Duo AMD Phenom 9500 || 2x1Quad Core 2.66 2 GHz || 2 GB 3GB || HDD || Ubuntu 8.10 Windows Vista || [[User:Duncan]]Jean-Raymond Floquet
|-
|H06 H07 || AMD Phenom 9500 Intel Pentium 4 || Quad Core 2.2 80 GHz || 3GB 512 MB * || HDD || Windows Vista Ubuntu 9.04 || Jean-Raymond Floquet[[User:Romjerome]]
|-
|H07 H08 || Intel Pentium 4 Celeron Dual Core || 2.80 60 GHz || 512 MB *2 GB || HDD || Ubuntu 910.04 || [[User:Romjerome]]
|-
|H08 H09 || Intel Celeron Dual Core i5-2520M || 2.60 50 GHz || 2 8 GB || SSD || Ubuntu 1014.04 .3 || [[User:RomjeromeSam888]]
|}
(*) + 80MB of swap used on import
=== Tests table legend ===
{| {{prettytable}}
|-
!Test Code !! test Test Description
|-
|T01 || Time to import GEDCOM/GRAMPS in empty native file format (GRDB)
|-
|T01_a || Time to import GEDCOM/GRAMPS XML in empty native file format (BSDDB)
|-
|T02 || Size native file format (GRDB)
|-
|T03 || Time to open native file format (GRDB) for clean/nonclean non-clean start on people view (*)
|-
|T04 || Time to open edit person dialog
|T07 || Sort on date in event view
|-
|T08 || Overal Overall editing responsiveness
|}
(*) clean start means computer restart (so also python methods/modules must be loaded and started). Non clean means you have opened GRAMPS Gramps with .grdb file before, and open it again. Parts will be still in memory and access will be faster, as well as python being in memory.
=== Performance results ===
{{man warn|General remark: tests |Tests are done with in GRAMPS Gramps preferences: '''transactions enabled''', unless indicated otherwise with '''notrans'''. This gives a performance boost. ''For safety: only change this setting on an empty database -- you are warned!''}}
{| {{prettytable}}
|-
!Comp Hardware Code !! GRAMPS Gramps !! data !! T01 !! T02
|-
|H03 ||bgcolor="#ffa0a0"| 2.2.4 notrans || d01 (xml)||bgcolor="#ffa0a0"| 2h || 542.6MB (v11)
{| {{prettytable}}
|-
!Comp Hardware Code !! data !! GRAMPS Gramps !! T03 !! T04 !! T05 !! T06 !! T07 !! T08 !! result
|-
|H02 || d01 ||bgcolor="#ffa0a0"| 2.2.4 || T03 = 4m17s || T04 = ? || T05 = ?/? || T06 = ? || T07 = ? || T08 = ||bgcolor="#ffa0a0"|
== Dataset summaries ==
For every test dataset, create a summary with [[Gramps_4.2_Wiki_Manual_-_Reports_-_part_6#Database_Summary_Report|Database Summary Report:  ;''Summary of the database'' ;'''Summary of database test d01''']]:
=== Database Summary Report's ===
==== Summary of database test d01 ====
Number of individuals: 100993
Males: 53046
Unique surnames: 15308
 ;'''==== Summary of database test d02''':====
Number of individuals: 82688
Males: 44736
Unique surnames: 13957
 ;'''==== Summary of database test d03''':====
Number of individuals: 124032
Males: 67104
Unique surnames: 20695
 ;'''==== Summary of database test d04''':====
Number of individuals: 52699
Males: 26420
Number of families: 24604
Unique surnames: 5822
 
==== Summary of database test d05 ====
Number of individuals: 2132
Number of families: 749
Number of events: 4981
Number of places: '''65598'''
Number of sources: 9
Number of media paths: 7
Number of repositories: 5
Number of notes: 1509
 
== User Stories ==
Running the tests can be slow, so here some user testimonies about Gramps Performance
=== Robert 2012-10, version 3.3.1 ===
I work with a database of 141,000+names currently without difficulty
(Gramps 3.3.1-1 on Fedora 16).
Initial start is fairly slow though.
First time to load each view is slow, but subsequent visits to views is
almost immediate.
Initial view load times:
* people 11 to 12 secs
* relationship abt 7 secs
* family 3 to 4 secs
* events 7 to 8 secs
* places 3 to 4 secs
* notes 11 to 12 secs
* ancestry view abt 1 sec or less
* Media abt 2 secs (although I only have about 1000 media in database)
* Repositories almost immediate
* sources about 1 sec - (time selecting a source varies according to number references for that source - my worst case is a civil registry which has about twice as many references as people in my database).
== Possible Future Optimizations ==
One can fine tune some things to obtain better results. An overview.
See if GRAMPS Gramps can pass this:
* [http://www.tamurajones.net/TheConfuciusChallenge.xhtml The Confucius Challenge]
** [http://www.tamurajones.net/ConfuciusCascade.xhtml Confucius Cascade] a real-world test based on consisting of increasingly gigantic GEDCOMs, tough time limits.
** [http://www.tamurajones.net/ConfuciusCup2008.xhtml Confucius Cup 2008]
** [http://www.tamurajones.net/TwoHugeGEDCOMFiles.xhtml Two Huge GEDCOM Files]
** [http://www.tamurajones.net/GedFan0.4.0.0.xhtml GedFan] - creates GEDCOM files, so-called fan files, which are used to test genealogy applications, and thus determine the capacity of those application, expressed as a fan value.
[[Category:Developers/General]]
[[Category:Documentation|Performance]]

Navigation menu