Solr Index Fields¶
A list of the fields defined in the solr search index used by the Coordinating Nodes.
These fields are populated by the index processor using values drawn from
Types.SystemMetadata
, Science Metadata, and Resource Map
documents.
Note
For Editors
Definitions are drawn from the solr configuration file and descriptions for each
field are contained in a separate properties file
(dataone-cn-solr/usr/share/dataone-cn-solr/debian/queryFieldDescriptions.properties
). After editing
descriptions, the document source must be regenerated and committed to GitHub for
the public facing documentation to be updated.
Static Fields¶
Field | Type | MV | Store | Index | Description |
---|---|---|---|---|---|
|
string | False | False | True | |
|
long | False | True | True | |
|
text_general | False | True | True | The full text of the abstract as provided in the science metadata document. |
|
text_general | True | True | True | Multi-valued field containing the text from attributeName, attributeLabel, attributeDescription, attributeUnit fields into a single searchable text field. |
|
text_general | True | True | True | Multi-valued field containing the attribute descriptive text. |
|
string | True | True | True | Multi-valued field containing secondary attribute name information. |
|
string | True | True | True | Multi-valued field containing the main attribute name information. |
|
string | True | True | True | Multi-valued field containing the attribute unit information. |
string | False | True | True | Principle Investigator (PI) / Author as listed in the metadata document. | |
string | False | True | True | The given name of the primary author/PI. | |
alphaOnlySort | False | True | True | The given name of the primary author/PI case normalized for sorting. | |
string | False | True | True | The node Id of the authoritative Member Node for the object. | |
string | True | True | True | The LAST name(s) of the author(s) | |
string | False | True | True | The sur name of the primary author/PI. | |
alphaOnlySort | False | True | True | The sur name of the primary author/PI case normalized for sorting. | |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
tdate | False | True | True | The starting date of the temporal range of the content described by the metadata document. |
|
string | True | True | False | A multi-valued field that contains the node Ids of member nodes that are blocked from holding replicas of this object. |
|
string | True | True | True | List of subjects (groups and individuals) that have change permission on PID. |
|
string | False | True | False | The checksum for the object |
|
string | False | True | False | Algorithm used for generating the object checksum |
|
string | True | True | True | Taxonomic class name(s) |
|
string | True | True | True | Name of the organization to contact for more information about the dataset |
|
text_general | True | False | True | Copy from contactOrganization |
|
string | False | True | True | The node Id of the member node that originally contributed the content. |
|
string | False | True | False | The URL that can be used to resolve the location of the object given its PID. |
|
tdate | False | True | True | The date and time when the object system metadata was last updated. |
|
tdate | False | True | True | Publication date for the dataset (this may or may not be coincident with when the content is added to DataONE). |
|
tdate | False | True | True | The date and time when the object was uploaded to the Member Node. |
|
string | False | True | True | The latest decade that is covered by the dataset, expressed in the form “1999-2009” |
|
string | True | True | True | Lists all PIDs that this object describes. Obtained by parsing all resource maps in which this object is referenced. Not set for data or resource map objects. |
|
tfloat | False | True | True | Eastern most longitude of the spatial extent, in decimal degrees, WGS84 |
|
text_general | False | True | True | The version or edition number of the item described. |
|
tdate | False | True | True | The ending date of the temporal range of the content described by the metadata document. |
|
string | True | True | True | Taxonomic family name(s) |
|
string | False | True | True | Contains the CNRead.resolve URL for the object ONLY if the object is a science metadata object. |
|
string | False | True | True | The file name for the object, specified in system metadata field with the same name. |
|
string | False | True | True | The format identifier indicating the type of content this record refers to. |
|
string | False | True | True | The format type of the record - DATA, METADATA, RESOURCE. |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
text_general | True | False | True | |
|
text_general | True | False | True | |
|
text_general | True | True | True | Keywords drawn from the GCMD controlled vocabulary |
|
string | True | True | True | Taxonomic genus name(s) |
|
string | False | True | True | The name of the general form in which the item’s geospatial data is presented |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
text_general | True | True | True | An encoded string that represents the geographic coordinates of the centroid of a spatial extent. This can be used for searching and plotting. |
|
string | False | True | True | The identifier of the object being indexed. |
|
text_general | False | True | True | Copy id |
|
string | True | True | True | Name of the investigator(s) responsible for developing the dataset and associated content. |
|
text_general | True | False | True | Copy from investigator. |
|
string | True | True | True | Lists all PIDs that describe this object. Obtained by parsing all resource maps in which this object is referenced. |
|
boolean | False | True | True | Set to True if the DataONE public user is present in the list of subjects with readPermission on PID. |
|
boolean | False | True | True | Set to true if document is a member node service description document. Use to filter search results for to exclude or include member node services. |
|
string | False | True | True | Set to “Y” for records that contain spatial information |
|
string | True | True | True | Terms drawn from a controlled vocabulary of concepts that are applicable to the content described by the metadata document. |
|
string | True | True | True | Keywords recorded in the science metadata document. These may be controlled by the generator of the metadata or by the metadata standard of the document, but are effectively uncontrolled within the DataONE context. |
|
text_general | True | False | True | Copy from keywords |
|
string | True | True | True | Taxonomic kingdom(s) |
|
string | False | True | True | Data provider organization identifier, for sources within the LTER network. |
|
string | False | True | True | The name attribute of the media type element in system metadata. Indicates media type of the object. |
|
string | True | True | True | A list of properties describing the media type in system metadata. The value is a concatenation of the property elements name attribute and the value of the property element. |
|
string | True | True | True | The name of the location(s) relevant to the content described by the metadata document. |
|
string | False | True | True | Set to “Y” if there is no bounding box information available (i.e., the east, west, north, south most coordinates) |
|
tfloat | False | True | True | Northern most latitude of the spatial extent, in decimal degrees, WGS84 |
|
int | False | True | False | Requested number of replicas for the object |
|
string | False | True | True | If set, indicates the object that replaces this record. |
|
string | False | True | True | If set, indicates the object that this record obsoletes. |
|
text_general | False | True | False | URL for Open Geospatial Web service if available. |
|
string | True | True | True | Taxonomic order name(s) |
|
string | True | True | True | Investigator or Investigator organization name. |
|
string | True | True | True | Investigator or Investigator organization name. Derived by normalizing origin. |
|
text_general | True | False | True | |
|
text_general | True | False | True | Copy from origin |
|
string | True | True | True | A characteristic, or variable, that is measured or derived as part of data-collection activities. |
|
text_general | True | False | True | Copy from parameter |
|
string | True | True | True | Taxonomic phylum (or division) name(s) |
|
text_general | True | True | True | A place name keyword, assigned by the metadata creator. It is one keyword from the thesaurus named in <placekt> |
|
string | True | True | False | A list of member node identifiers that are preferred replication targets for this object. |
|
string | False | True | True | Type of data being preserved (maps, text, etc.) |
|
string | False | True | True | The authorized name of a research effort for which data is collected. This name is often reduced to a convenient abbreviation or acronym. All investigators involved in a project should use a common, agreed-upon name. |
|
text_general | False | False | True | Copy from project |
|
string | True | True | True | A multi-valued field containing the identifiers of data objects that this program generated based on the PROV wasGeneratedBy, qualifiedAssociation, and hadPlan properties. |
|
string | True | True | True | A multi-valued field containing the identifiers of the executions that this data object was generated by based on the PROV wasGeneratedBy property. |
|
string | True | True | True | A multi-valued field containing the identifiers of the programs that this data object was generated by based on the PROV wasGeneratedBy, qualifiedAssociation, and hadPlan properties. |
|
string | True | True | True | A multi-valued field containing the identifiers of the users that this data object was generated by based on the PROV wasGeneratedBy, qualifiedAssociation, and agent properties. |
|
string | True | True | True | A multi-valued field containing the identifiers of the data objects that were derivations of the source data object described by this metadata object, based on the PROV wasDerivedBy property. |
|
string | True | True | True | A multi-valued field containing the identifiers of the data objects that were sources to the derived data object described by this metadata object, based on the PROV wasDerivedBy property. |
|
string | True | True | True | A multi-valued field containing the identifiers of the semantic classes that this object is an instance of, based on the PROV, ProvONE, and other ontologies. |
|
string | True | True | True | A multi-valued field containing the identifiers of data objects that this program used based on the PROV used, qualifiedAssociation, and hadPlan properties. |
|
string | True | True | True | A multi-valued field containing the identifiers of the executions that used this data object based on the PROV used property. |
|
string | True | True | True | A multi-valued field containing the identifiers of the programs that used this data object based on the PROV used, qualifiedAssociation, and hadPlan properties. |
|
string | True | True | True | A multi-valued field containing the identifiers of the users that used this data object based on the PROV used, qualifiedAssociation, and agent properties. |
|
string | True | True | True | A multi-valued field containing the identifiers of data objects that this data object was derived from based on the PROV wasDerivedBy property. |
|
string | True | True | True | A multi-valued field containing the identifiers of the executions that used this program based on the PROV qualifiedAssociation, and hadPlan properties. |
|
string | True | True | True | A multi-valued field containing the identifiers of the users that executed this program based on the PROV qualifiedAssociation, hadPlan, and agent properties. |
|
string | True | True | True | A multi-valued field containing the identifiers of executions that this execution was informed by based on the PROV wasInformedBy property. |
|
tdate | False | True | True | Publication date for the dataset (this may or may not be coincident with when the content is added to DataONE). |
|
text_general | False | True | True | The “Purpose” describes the “why” aspects of the data set (For example, why was the data set created?). |
|
string | True | True | True | List of subjects (groups and individuals) that have read permission on PID. |
string | True | True | True | ||
|
string | True | True | True | One or more node Ids holding copies of the object. |
|
boolean | False | True | False | True if this object can be replicated. |
|
tdate | True | True | False | |
|
string | True | True | True | List of resource map PIDs that reference this PID. |
|
string | False | True | True | The Subject that acts as the rights holder for the object. |
|
string | True | True | True | Taxonomic scientific name(s) at the most precise level available for the organisms of relevance to the dataset |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
string | True | True | True | Also called “instrument.” A device that is used for collecting data for a data set. |
|
text_general | True | False | True | Copy from sensor. |
|
string | False | True | True | The seriesId is an optional, unique Unicode string that identifies an object revision chain. |
|
string | False | True | True | Either ‘tight’, ‘mixed’, or ‘loose’. Tight coupled service work only on the data described by this metadata document. Loose coupling means service works on any data. Mixed coupling means service works on data described by this metadata document but may work on other data. |
|
text_general | False | True | True | A human readable description of the member node service to assist discovery and to evaluate applicability. |
|
string | True | True | True | A URL that indicates how to access the member node service. |
|
string | True | True | True | Aspect of the service that accepts a digital entity. Either a list of DataONE formatIds Urls or pid RESOLVE Urls that the member node service operates on. A pid RESOLVE url indicates a ‘tight’ coupled service - while a list of formatIds indicates a loose coupled service. |
|
string | True | True | True | Aspect of the service that provides a digital entity resulting from operation of the service. A listing of DataONE formatId which this member node service produces. |
|
text_general | False | True | True | A brief, human readable descriptive title for the member node service. |
|
string | True | True | True | The type of service being provided by the member node. |
|
string | True | True | True | The name or description of the physical location where the data were collected |
|
text_general | True | False | True | Copy from site. |
|
tlong | False | True | True | The size of the object, in bytes. |
|
string | True | True | True | Also called “platform.” The mechanism used to support the sensor or instrument that gathers data |
|
text_general | True | False | True | Copy from source. |
|
tfloat | False | True | True | Southern most latitude of the spatial extent, in decimal degrees, WGS84 |
|
string | True | True | True | Taxonomic species name(s) |
|
string | False | True | True | The Subject name of the original submitter of the content to DataONE. |
|
string | True | True | True | A secondary subject area within which parameters can be categorized. Approved terms include “agricultural chemicals” and “atmospheric chemistry,” among many others. When entering a term in the LandVal Metadata Editor, users should select a standard expression from the pick list for terms if at all possible. |
|
text_general | True | False | True | Copy from term. |
|
text_en_splitting | False | True | True | Full text of the metadata record, used to support full text searches |
|
text_general | False | True | True | Title of the dataset as recorded in the science metadata. |
|
string | False | False | True | Copy from title. |
|
string | True | True | True | The most general subject area within which a parameter is categorized. Approved topics include “agriculture,” “atmosphere,” and “hydrosphere,” among others. |
|
text_general | True | False | True | Copy from topic. |
|
tdate | False | True | True | Copy from dateuploaded. |
|
string | True | True | False | Link to the investigator’s web-site. |
|
tfloat | False | True | True | Western most longitude of the spatial extent, in decimal degrees, WGS84 |
|
string | True | True | True | List of subjects (groups and individuals) that have write permission on PID. |
Dynamic Fields¶
Field | Type | MV | Store | Index | Description |
---|---|---|---|---|---|
|
boolean | False | True | True | |
|
boolean | True | True | True | |
|
boolean | True | True | True | |
|
currency | False | True | True | |
|
tdouble | False | False | True | |
|
double | False | True | True | |
|
tdouble | True | True | True | |
|
double | True | True | True | |
|
date | False | True | True | |
|
tdate | True | True | True | |
|
date | True | True | True | |
|
text_en | True | True | True | |
|
float | False | True | True | |
|
tfloat | True | True | True | |
|
float | True | True | True | |
|
int | False | True | True | |
|
tint | True | True | True | |
|
int | True | True | True | |
|
long | False | True | True | |
|
tlong | True | True | True | |
|
long | True | True | True | |
|
location | False | True | True | |
|
string | False | True | True | |
|
string | True | True | True | |
|
string | True | True | True | |
|
text_general | False | True | True | |
|
tdouble | False | True | True | |
|
tdate | False | True | True | |
|
tfloat | False | True | True | |
|
tint | False | True | True | |
|
tlong | False | True | True | |
|
text_general | True | True | True | |
|
text_general | True | True | True | |
|
text_general | True | True | True | |
|
ignored | True | False | False | |
|
random | False | False | False |