<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd"><identifier identifierType="DOI">10.48349/ASU/BOK3VO</identifier><creators><creator><creatorName nameType="Personal">Sluka, James P.</creatorName><givenName>James P.</givenName><familyName>Sluka</familyName><nameIdentifier nameIdentifierScheme="ORCID">0000-0002-5901-1404</nameIdentifier><affiliation>Indiana University Bloomington</affiliation></creator><creator><creatorName nameType="Personal">Zelinski, Mary B.</creatorName><givenName>Mary B.</givenName><familyName>Zelinski</familyName><nameIdentifier nameIdentifierScheme="ORCID">0000-0002-8513-1549</nameIdentifier><affiliation>Oregon Health &amp; Science University</affiliation></creator><creator><creatorName nameType="Personal">Watanabe, Karen H.</creatorName><givenName>Karen H.</givenName><familyName>Watanabe</familyName><nameIdentifier nameIdentifierScheme="ORCID">0000-0002-3572-6667</nameIdentifier><affiliation>Arizona State University</affiliation></creator><creator><creatorName nameType="Personal">Dietrich, Suzanne W.</creatorName><givenName>Suzanne W.</givenName><familyName>Dietrich</familyName><nameIdentifier nameIdentifierScheme="ORCID">0000-0003-3301-1227</nameIdentifier><affiliation>Arizona State University</affiliation></creator><creator><creatorName nameType="Personal">Riley Israels</creatorName><givenName>Riley</givenName><familyName>Israels</familyName><affiliation>Arizona State University</affiliation></creator></creators><titles><title>Replication Data for: Follicle Identification in Primate Ovaries via Machine Learning</title></titles><publisher>ASU Library Research Data Repository</publisher><publicationYear>2025</publicationYear><subjects><subject>Medicine, Health and Life Sciences</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" valueURI="http://purl.bioontology.org/ontology/MESH/D006080" subjectScheme="MeSH">Ovarian Follicle</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" valueURI="http://purl.bioontology.org/ontology/MESH/D001185" subjectScheme="MeSH">Artificial intelligence</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" valueURI="http://purl.bioontology.org/ontology/MESH/D000069550" subjectScheme="MeSH">Machine Learning</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" valueURI="http://purl.bioontology.org/ontology/MESH/D007091" subjectScheme="MeSH">Image Processing, Computer-Assisted</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" valueURI="http://purl.bioontology.org/ontology/MESH/D000098410" subjectScheme="MeSH">Transfer Machine Learning</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" subjectScheme="MeSH">Image Processing, Computer-Assisted</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" subjectScheme="MeSH">Ovarian Follicle</subject><subject schemeURI="http://www.informatics.jax.org/vocab/gene_ontology/GO:0001541" subjectScheme="Gene Ontology">Ovarian Follicle Development</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" subjectScheme="MeSH">Macaca mulatta</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" subjectScheme="MeSH">Macaca fuscata</subject><subject schemeURI="https://www.nlm.nih.gov/mesh/meshhome.html" subjectScheme="MeSH">Macaca fascicularis</subject><subject schemeURI="http://purl.obolibrary.org/obo/UBERON_0000474" subjectScheme="UBERON">Female Reproductive System</subject></subjects><contributors><contributor contributorType="ContactPerson"><contributorName nameType="Personal">Sluka, James P.</contributorName><givenName>James P.</givenName><familyName>Sluka</familyName><affiliation>Indiana University</affiliation></contributor><contributor contributorType="ContactPerson"><contributorName nameType="Personal">Karen Watanabe</contributorName><givenName>Karen</givenName><familyName>Watanabe</familyName><affiliation>Arizona State University</affiliation></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Rao, Parth Ravindra</contributorName><givenName>Parth Ravindra</givenName><familyName>Rao</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Nagda, Param</contributorName><givenName>Param</givenName><familyName>Nagda</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Daniele, Alessia</contributorName><givenName>Alessia</givenName><familyName>Daniele</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Jurado Gutierrez, Aleli</contributorName><givenName>Aleli</givenName><familyName>Jurado Gutierrez</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Egusquiza Diaz, Eliany</contributorName><givenName>Eliany</givenName><familyName>Egusquiza Diaz</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Villanueva, Edmundo</contributorName><givenName>Edmundo</givenName><familyName>Villanueva</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Hernandez, Gabriella</contributorName><givenName>Gabriella</givenName><familyName>Hernandez</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Shah, Gaurika</contributorName><givenName>Gaurika</givenName><familyName>Shah</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Azooz, Masara</contributorName><givenName>Masara</givenName><familyName>Azooz</familyName></contributor><contributor contributorType="Researcher"><contributorName nameType="Personal">Ding, Yian</contributorName><givenName>Yian</givenName><familyName>Ding</familyName></contributor></contributors><dates><date dateType="Updated">2025-09-19</date><date dateType="Collected">2021-03-01/2025-02-28</date></dates><resourceType resourceTypeGeneral="Dataset">H &amp; E Histology images</resourceType><sizes><size>640665346</size><size>28297489830</size><size>21474836480</size><size>21474836480</size><size>21474836480</size><size>21474836480</size><size>21474836480</size><size>20769543349</size><size>4087291423</size><size>83615</size></sizes><formats><format>application/zip</format><format>application/zip</format><format>application/octet-stream</format><format>application/octet-stream</format><format>application/octet-stream</format><format>application/octet-stream</format><format>application/octet-stream</format><format>application/octet-stream</format><format>application/zip</format><format>application/pdf</format></formats><version>1.0</version><rightsList><rights rightsURI="info:eu-repo/semantics/openAccess"/><rights rightsURI="http://creativecommons.org/licenses/by-nc/4.0">CC BY-NC 4.0</rights></rightsList><descriptions><description descriptionType="Abstract">&lt;b>Overview:&lt;/b>
&lt;p>
The number and types of follicles present in the ovary are key indicators of the reproductive health and capacity in females. This data set contains annotated H&amp;E histology images from Rhesus (n=14), Cynomolgus (n=3) and Japanese (n=1) macaque (monkey) ovaries. The follicle images span the 6 preantral stages of primate ovarian follicle development: primordial, transitional primordial, primary, transitional primary, secondary, and multilayer. Follicle types were assigned by human experts. This data set is suitable for training machine learning algorithms to automatically identify and count follicles across these six developmental stages in ovarian histology images from non-human primates. In total, the dataset contains approximately 7,700 annotated follicles. These data were generated as part of the MOTHER-DB.org project.
&lt;p>
The data are partitioned across multiple zip archives, which are described in detail below. Within these zip files, the individual sub-images, which were extracted from full size histology images, are centered on a classified follicle and are 200 by 200 pixels (138 by 138 micrometer) in size. The source histology image, follicle type, and any manipulations on the sub-image, are encoded in the folder and sub-image file names. See 'README_FileNamingConventions' for details on interpreting the folder and filenames. The zip files include folders for each of the follicle classes. Note that the individual sub-image filenames also contain the follicle class. Therefore, if desired, you can combine all of the sub-images into a single folder without losing the follicle type assignments.
&lt;p>
&lt;b>The complete data set, “MOTHER_Macaque_Monkey_Preantral_Follicles.zip.00N”: &lt;/b>
&lt;p>
Within this Zip archive, individual images are partitioned in folders by follicle type and Train, Test and Validate subfolders used for training our machine learning algorithm. In addition, various image augmentations are included such as color inversion, image rotations, etc. Each annotation of a particular follicle generates a total of 48 augmentations. The set of 48 augmentations (which includes the original) for a particular annotation will always be in the same Train, Test or Validate folder. The data set also contains an extensive set of images representing non-follicle portions of the ovary. These images can be used as counter examples to the preantral follicle classifications sets. The image filenames identify the name of the full-size histology image, the follicle type, the location of the annotation in the full-size image and information about how it was augmented. The Train, Test, and Validate partition was done randomly to give partitions of 75:20:5. If desired, these three folders can be combined and the data repartitioned.
&lt;p>
In total, the &lt;b>dataset contains 1.7 million images&lt;/b> based on approximately 7,700 annotated follicles. &lt;b>This is a large dataset at ~120GB.&lt;/b> You need to download the entire set of zip archives with the “.zip.00N” extensions, where N is a digit from 1 to 6. Each zip file is about 20GB. &lt;b>A stable high-speed network is needed.&lt;/b> It will likely take several hours to download all six zip files.
&lt;p>
Zip software will reconstruct the complete zip archive if you open the first file in the series. We have tested unpacking these multipart zip files using The Unarchiver for Mac (https://the-unarchiver.macupdate.com/) and 7-Zip for Windows and Linux (https://www.7-zip.org/download.html).
&lt;p>
&lt;b>Smaller data set that omits the Negatives, “MOTHER_Macaque_Monkey_Preantral_Follicles_NoNegatives.zip”:&lt;/b>
 &lt;p>
This data set omits the “Negative” images and only contains the sub-images of annotated follicles and their augmentations. The zip file contains ~370K images based on ~7,700 annotated follicles and is about 20% as large as the complete data set described above. 
&lt;p>
&lt;b>Smallest data set that omits the Negatives and Augmentations,
“MOTHER_Macaque_Monkey_Preantral_Follicles_NoNegatives_NoAugmentations.zip”: &lt;/b>
&lt;p>
This data set omits the “Negatives” and all augmentation images and only contains the sub-images of annotated follicles. The zip file contains ~7,700 images, one for each of our expert-annotated follicles. 
&lt;p>
&lt;b>
Complete set of original histology images and annotations files, “MOTHER_TrainingData_HistoSlides_AnnotTables_20250812.zip”:&lt;/b>
 &lt;/p>
&lt;p>The “MOTHER_TrainingData_HistoSlides_AnnotTables_20250812.zip” file (3.8GB) contains paired full size histology images, and follicle annotation files. The follicle annotation files give the location and follicle type of every identified follicle in the image. All images in this dataset have a resolution of 0.69 micrometer/pixel and are in ome.tif format. The image files range in size from 130MB to 620MB each. The annotations files were output from QuPath, have a “.txt” extension, and are tab delimited text files. Each histology image file has an associated annotations file. For more information see the README_Training_Data_20250409.pdf file included in the zip file.
&lt;/p>
&lt;p>
&lt;b>README File Naming Conventions:&lt;/b>
&lt;/p>
&lt;p>
The "README_FileNamingConventions.pdf" contains a detailed description of the naming conventions used for the folders and sub-image file names. The filenames contain all the information needed to identify the assigned follicle type, the original histology slide it was derived from, and any augmentation details.</description><description descriptionType="Methods">&lt;p>&lt;b>Original ovary histology images&lt;/b> are available at &lt;a href="https://mother-db.org">https://mother-db.org&lt;/a>&lt;/p>

&lt;p>&lt;b>Python code&lt;/b> to generate the individual follicle subimages from annotated ovary histology sections is available in the MOTHER GitHub repository. See &lt;a href="https://github.com/mother-db/MOTHER-DB-annotation-tools">https://github.com/mother-db/MOTHER-DB-annotation-tools&lt;/a>&lt;/p></description></descriptions><geoLocations/><fundingReferences><fundingReference><funderName>National Science Foundatoin</funderName><awardNumber>NSF DBI--2054061</awardNumber></fundingReference><fundingReference><funderName>National Institutes of Health</funderName><awardNumber>P51 OD011092</awardNumber></fundingReference></fundingReferences></resource>