## page was renamed from ICARUS-Search-Perspective = ICARUS-Search-Perspective = The {{attachment:search_perspective.png}} perspective provides the following search types: * Dependency-Search * Coreference-Documents * Error Mining for Part-Of-Speech Tags * Error Mining for Dependency Structure [[ICARUS|{{attachment:navi_up.png|Back to ICARUS Main Page}}]][[extern/ICARUS|Back to ICARUS Main Page]] <> == Index: == I. [[#how-to-newsearch|How to set up a new search]] I. [[#search-menu|Search Menu]] I. [[#result-outline|Result Outline]] I. [[#dependency-search|Dependency-Search:]] i. [[#search-parameter-dependency|Search Parameter (Dependency-Search)]] i. [[#graph-query-editor|Graph Query Editor (Dependency-Search)]] i. [[#result-outline-dependency|Result Outline (Dependency-Search)]] I. [[#error_mining|Error Mining:]] i. [[#search-parameter-errormining|Search Parameter (Error Mining)]] i. [[#errormining-query-editor|Error Mining Query Editor]] i. [[#result-outline-errormining|Result Outline (Error Mining)]] I. [[#tutorials|Tutorials (including videos):]] i. [[#tutorial-1D|Tutorial Dependency Search (passive constructions) with one grouping operator]] i. [[#tutorial-2D|Tutorial Dependency Search (passive constructions with overt logical subjects)]] i. [[#tutorial-3D|Tutorial Dependency Search (passive constructions with overt logical subjects and object)]] i. [[#tutorial-em|Tutorial Error Mining]] <> === I. How to set up a new search: === 1. Click on {{attachment:search_new.png}} to create a new search. 1. Afterwards the search need to be configured: 1. {{attachment:search_configuration.png||align="top"}} * Type: Select the desired search mode (dependency, error mining, coreference,...) * Data-Set: Select the Treebank/Document * Query: Clicking {{attachment:search_query.png}} opens the query editor. There may be different types of query editors depending on the search type. * Parameters: Search pararameters depending on the search type. 1. Execute Search using the {{attachment:search_execute.png}} button 1. View the Result by double-clicking the search result or use the inspect-button {{attachment:search_inspect.png}} [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === II. Search Menu: === {{attachment:search_manager_menu.png}} * {{attachment:search_preferences.png}} = Open the preferences * {{attachment:search_new.png}} = Create a new search * {{attachment:search_execute.png}} = Executes the search. Note if no data-set was set the button is disabled {{attachment:search_execute_inactive.png}} '''Search History Toolbar:''' {{attachment:search_history-tb.png}} . Every executed search is listed in the search history. The history is available until you close your ICARUS session. The figure shows three search history items. During the search process the icons to the left may change: * {{attachment:search_runing+loading.png|search_runing+loading.png}} Search is active (first icon) but the target data-set is not loaded yet (second icon) * {{attachment:search_runing+loaded.png|search_runing+loaded.png}} Search is active (first icon) and target data-set is loaded (second icon) * {{attachment:search_finished+loaded.png|search_finished+loaded.png}} Search finished successful (first icon) and target data-set is loaded (second icon) * {{attachment:search_icon_error.png}} Search was not successful. * {{attachment:search_clearhistory.png}} = Clear all search history items * {{attachment:search_remove1search.png}} = Remove the selected search result from the history * {{attachment:search_viewquery.png}} = Display the query of the selected search * {{attachment:search_inspect.png}} = Display the result of selected search * {{attachment:search_cancel.png}} Cancel selected search [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === III. Result Outline: === [[attachment:search_result_1D.png|{{attachment:search_result_1D.png|attachment:search_result_1D.png|width="400"}}]] * Aggregated result visualization depending on the number of grouping operators (dimensions) for up to three groups (3D) * Result highlighting for instances of query constraints * Fully customizable graph visualization * Easy navigation through results for up to three groups (3D) [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] ---- <> === IV. Dependency-Search === i. [[#search-parameter-dependency|Search Parameter (Dependency-Search)]] i. [[#graph-query-editor|Graph Query Editor (Dependency-Search)]] i. [[#result-outline-dependency|Result Outline (Dependency-Search)]] [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === Search Parameter (Dependency-Search): === * '''Search-Mode:''' Non-Exhaustive (stop after first hit), Exhaustive (add each sentence to the result at most one) and Exhaustive search with Grouping * '''Direction:''' Left-To-Right or Right-To-Left * '''Case-Sensitiv:''' On/Off * '''Result Limit:''' limit the search result (number of hits) [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === Graph Query Editor (Dependency-Search): === {{attachment:search_query-editor-tab.png}} This tab is used to build a query. Graph Editor Toolbar: {{attachment:search_graph-tb.png}} * {{attachment:search_preferences.png}} = Open the preferences * {{attachment:search_arc-layout.png}} = Change the current graph layout. There are three different layout types available {{attachment:search_qe_layouts.png||align="middle"}} 1. Arc layout {{attachment:search_qe_arc-layout.png||align="middle"}} 1. No layout {{attachment:search_qe_no-layout.png||align="middle"}} 1. Tree {{attachment:search_qe_tree-layout.png||align="middle"}} * {{attachment:search_clear.png}} = Clear graph panel - every nodes/edges are deleted * {{attachment:search_export.png}} = Save the current search graph to XML file (may be imported later) * {{attachment:search_import.png}} = Import a search graph XML file * {{attachment:search_print.png}} = Print the current graph * {{attachment:search_add-node.png}} = Add a new node to the current search graph * {{attachment:search_add-disjunction.png}} = Adds a new disjunction to the current search graph * {{attachment:search_add-edge.png}} = Connects two nodes (two nodes must be selected before this action can be performed) * {{attachment:search_add-pedge.png}} = Connects two nodes with a precedence relation (two nodes must be selected before this action can be performed) * {{attachment:search_delete.png}} = Delete selected node/edge (multi selection possible) * {{attachment:search_edit.png}} = Opens the edit node/edge dialog (Instead of using this button you may doubleclick a node/edge to open the edit dialog) * {{attachment:search_clone.png}} = Duplicate (copy and insert) the selected nodes/edges. Quick way to duplicate a graph. Note: edges are only copied when their source and target node is selected. * {{attachment:search_copy_button.png}} = Copy and the selected nodes/edges. Note: edges are only copied when their source and target node is selected. (strg+c) * {{attachment:search_paste_button.png}} = Paste previously copied nodes/edges. (strg+p) * {{attachment:search_reorder-graph.png}} = Redraw the graph, can be useful while adding new nodes, edges or constraints may mess up the graph layout. Example (arc-layout): (left nodes/edges unsorted; right nodes/edges reorderes) {{attachment:search_graph-redraw.png||align="middle"}} Note: The copy&paste nodes/edges can be used to copy graphs from/into other perspectives (e.g. Tutorial 1D,..) * {{attachment:search_undo.png}} = Undo the last graph editor operation * {{attachment:search_redo.png}} = Redo the last graph editor operation * {{attachment:search_zin.png}} = Increase zoom level * {{attachment:search_zdefault.png}} = Switch back to the default zoom level * {{attachment:search_zout.png}} = Decrease zoom level * {{attachment:search_zauto.png}} = Autofit zoom level to the current graph panel size (default off) * {{attachment:search_zcompress.png}} = Compress graph (right-left). Merge node/edge information into a node. Search annotation highlight is never merged and always visible even. (default off) * {{attachment:search_toggle_c-d.png}} = If there are different (unconnected) graphs A, B the search will use the following query (A v B). '''Text Query Editor Toolbar:''' {{attachment:search_query-tb-text.png}} * {{attachment:search_undo.png}} = Undo the last text editor operation * {{attachment:search_redo.png}} = Redo the last text editor operation * {{attachment:search_copy_button.png}} = Copy and the selected text. (strg+c) * {{attachment:search_paste_button.png}} = Paste previously copied text. (strg+p) * {{attachment:search_select-all.png}} = Select the entire query text (strg+a) * {{attachment:search_clearhistory.png}} = Clear the text query panel. * {{attachment:search_save-graph-to-desc.png}} = Save query graph to the current selected search history item * {{attachment:search_sync-to-graph.png}} = Generate search graph from text query * {{attachment:search_sync-to-text.png}} = Generate text query from search graph [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === Result Outline (Dependency-Search): === {{attachment:search_result-tab.png}} Use this tab to browse the search results. The visialization may be seperated into four differnet presentation styles. We describe the different types in the following section. '''Result Outline Toolbar:''' {{attachment:search_result-base-tb.png}} * {{attachment:search_preferences.png}} = Open the preferences * {{attachment:search_result-search-desc.png}} Short query description and number of matches (here 3 grouping operators and 10 matches) * {{attachment:search_reorder-graph.png}} = Refresh the result outline * {{attachment:search_export.png}} = Save the current search result to a XML file (may be imported later) * {{attachment:search_import.png}} = Import search result XML file * {{attachment:search_clearhistory.png}} = Close the result outline * {{attachment:search_result-grouping-desc.png}} Grouping operator {{attachment:search_grouping-operator.png}} result informations. The corresponding {{attachment:search_grouping-operator.png}} color and the number of matches for each {{attachment:search_grouping-operator.png}} (ICARUS supports up to three grouping operators) (In this example we have 1. lemma- {{attachment:search_grouping-operator.png}} (red) 8 matches, 2. lemma- {{attachment:search_grouping-operator.png}} (green) 5 matches and 3. pos- {{attachment:search_grouping-operator.png}} 4 matches) '''0. No grouping operator {{attachment:search_grouping-operator.png}} is used.''' * Query: {{attachment:search_query-0D.png||align="middle"}} * Text Query: [lemma=be [relation=VC, pos=VBN]] * Result Toolbar: {{attachment:search_result-tb-0D.png}} The result is presented as a list of sentences. Every occurence that matches the query is colored blue. Results (0D) [[attachment:search_result_0D.png|{{attachment:search_result_0D.png|attachment:search_result_0D.png|width="300"}}]] '''1. One grouping operator {{attachment:search_grouping-operator.png}} is used.''' * Query: {{attachment:search_query-1D.png||align="middle"}} * Text Query: [lemma=be [relation=VC, lemma<*>1, pos=VBN]] * Result Toolbar: {{attachment:search_result-tb-1D.png}} All lemma types found are shown in the list (red) to the left. The user may select one lemma type to get all instances with matching query. Every occurence that matches the query is colored blue and the "grouped" lemma colored red. Results (1D) [[attachment:search_result_1D.png|{{attachment:search_result_1D.png|attachment:search_result_1D.png|width="300"}}]] * Options: {{attachment:search_result-option-1D.png}} * {{attachment:search_numeric_switch.png}} = Switch between numeric/percentage result numbers (total) * {{attachment:search_sort.png}} = Sort by wordform or by occurence (ascending/descending) * {{attachment:search_reset_sort.png}} = Reset list sorting '''2. Two grouping operators {{attachment:search_grouping-operator.png}} are used.''' * Query: {{attachment:search_query-2D.png||align="middle"}} * Text Query: [lemma=be [relation=VC, lemma<*>1, pos=VBN [relation=LGS, form=by [relation=PMOD, lemma<*>2]]]] * Result Toolbar: {{attachment:search_result-tb-2D.png}} The result is presented as a table. Grouping operator one (red) is on the y-axis and grouping operator two (green) on the x-axis (Note: The x-/y-axis may be fliped clicking on {{attachment:search_flip-table.png}} ). Every occurence that matches the query is colored blue. Results (2D) [[attachment:search_result_2D-a.png|{{attachment:search_result_2D-a.png|attachment:search_result_2D-a.png|width="300"}}]] [[attachment:search_result_2D-b.png|{{attachment:search_result_2D-b.png|attachment:search_result_2D-b.png|width="300"}}]] * Options: {{attachment:search_result-option-2D.png}} * {{attachment:search_numeric_switch.png}} = Switch between numeric/percentage result numbers (total) * {{attachment:search_sort.png}} = Sort y-axis by wordform or by occurence (ascending/descending) * {{attachment:search_sort_x-axis.png}} = Sort x-axis by wordform or by occurence (ascending/descending) * {{attachment:search_flip-table.png}} = Swap the x-/y-axis (e.g.: (old) x-axis = (new) y-axis and vice versa) * {{attachment:search_reset_sort.png}} = Reset table sorting '''3. Three grouping operators {{attachment:search_grouping-operator.png}} are used.''' * Query: {{attachment:search_query-3D.png||align="middle"}} * Text Query: [lemma=be [relation=VC, lemma<*>1, pos=VBN [relation=LGS, form=by [relation=PMOD, lemma<*>2]][relation=OBJ, lemma<*>3]]] * Result Toolbar: {{attachment:search_result-tb-3D.png}} The result is presented as a list of sentences. Every occurence that matches the query is colored blue. Results (3D) [[attachment:search_result_3D-a.png|{{attachment:search_result_3D-a.png|attachment:search_result_3D-a.png|height="170"}}]] [[attachment:search_result_3D-b.png|{{attachment:search_result_3D-b.png|attachment:search_result_3D-b.png|width="300"}}]] * Options: {{attachment:search_result-option-3D.png}} * {{attachment:search_numeric_switch.png}} = Switch between numeric/percentage result numbers (total) * {{attachment:search_sort.png}} = Sort by wordform or by occurence (ascending/descending) * {{attachment:search_reset_sort.png}} = Reset list sorting * {{attachment:search_sort.png}} = Sort y-axis by wordform or by occurence (ascending/descending) * {{attachment:search_sort_x-axis.png}} = Sort x-axis by wordform or by occurence (ascending/descending) * {{attachment:search_flip-table.png}} = Swap the x-/y-axis (e.g.: (old) x-axis = (new) y-axis and vice versa) * {{attachment:search_reset_sort.png}} = Reset table sorting * {{attachment:search_3D-reorder.png}} = Change the grouping operor ([0] = list, [1] = table y-axis and [2] = table x-axis). In this example {{attachment:search_3D-reorder-dialog.png||align="middle"}} we have [0] = first {{attachment:search_grouping-operator.png}} (red), [1] = second {{attachment:search_grouping-operator.png}} (green) and [3] = third {{attachment:search_grouping-operator.png}} (brown) At the lower part of the graph panel is the text outline. The list contains all search results of the selected instance. The selected sentence is shown in the graph panel. '''Toolbar:''' {{attachment:text-tb.png}} * {{attachment:text_outline.png}} = Toggle a textpanel to copy the selected sentence. (see below) * {{attachment:text_outline-on.png}} * {{attachment:item-first.png}} = First sentence * {{attachment:item-previous.png}} = Previous sentence * {{attachment:sentence-nr.png}} = Shows the current selected sentence (first number) and the total sentences (last number). In the example figure sentence 2 of 3 is selected. The user may navigate using the arrows to the left/right. It is possible to enter the sentence no. in this field by pressing "return" the sentence pops up. Note that the sentence numbers belong to the the internal index (the corpus index may differ for example if one sentence number have been skipped) * {{attachment:item-next.png}} = Next sentence * {{attachment:item-last.png}} = Last sentence [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] ---- <> === V. Error Mining === i. [[#search-parameter-errormining|Search Parameter (Error Mining)]] i. [[#errormining-query-editor|Error Mining Query Editor]] i. [[#result-outline-errormining|Result Outline (Error Mining)]] To detect sequence annotation errors within part-of-speech tags we implemented the algorithm introduced by Dickinson and Meurers (2003) [1]. Additionally for structured annotations we choose the approach shown in Boyd et al. (2008) [2] that targets inconsistency within dependency structures. We designed and built a graphical user interface (GUI) that is easy to handle and user-friendly. Implementing state-of-the-art algorithms for error detection with an user-friendly interface increase the operation domain because the algorithms can be used by a wider audience without deeper knowledge of computers. It provides even non-expert users with the capability to find inconsistent pos tags and dependency structures within a corpus. [1] Dickinson, M. and Meurers, W. D. (2003). Detecting errors in part-of-speech annotation. In Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL-03), pages 107–114, Budapest, Hungary. [2] Boyd, A., Dickinson, M., and Meurers, D. (2008). On detecting errors in dependency treebanks. Research on Language and Computation, 6(2):113–137. [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === Search Parameter (Error Mining): === * '''Replace all Numbers by Special Token:''' When the ''number wildcard replacement'' filter is enabled the algorithm checks for every word-form during the error mining process if the current word is a number. This is done using a regular expression that flags all words where the first letter is a number (0...9). These words will be replaced with a special ''NumberWildcard'' token. It provides the error mining algorithm with the capability to compare strings that contain different numbers and treat them equally in order to find variation within the non-number word-forms. * '''Use Fringe Heuristic:''' The fringe heuristic is used to filter n-grams where the nucleus occurs at the start/end of the n-gram. This is useful because when the nucleus is surrounded by words the probability that we find an error is higher. * '''Maximum NGram Size (passes):''' Limit the maximum n-gram size (size = algorithm iterations). By default this parameter is zero which is equivalent to ∞ . * '''Maximum Sentences for Input:''' The sentence limitation is used to limit the number of sentences that are used for the error mining. Starting at sentence one until the specified value ''x'' is reached. For example with a limit of 10,000 at most the first 10,000 sentences of the specified corpus will be used during the error mining process. Note: Using this option has a strong influence on the results and should be used carefully, because limiting the input data may leak the variation for one word. By default this value is "0" (zero) and the engine will use all sentences of the given corpus. * '''Show only NGrams with a size of:''' Even when the fringe heuristic is enabled the results will still contain uni-/bi-grams. Using the ''Show only NGrams with a size of'' option allows the user to filter the resulting n-grams. For example if the value is set to "1", the resulting list will contain 2-, 3-, n-grams, ... . * '''Create XML Output File:''' Using the ''Output to File'' option creates an xml-formatted file. It contains information about the word-forms, tags, tag-count and highlight information. It is formatted in a human-readable way so that its possible to do error detection even without the graphical support of the error mining plug-in. (By default no outputlocation is set in the {{attachment:search_preferences.png}} and the user will be asked for the desired filelocation when the error mining task is complete) [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === Error Mining Query Editor: === {{attachment:search_query-editor-tab.png}} This tab is used to build a query. A single query item contain of the following parts: i. Include Tag (boolean) = All tags that are ignored (''Include Tag=true'') are mapped onto a special "ignoredtag"-subclass. This option has priority over the new tag definition. i. Tagclass (string) = If the current tag matches the ''Tagclass'' it may be ''included'' or assigned with a ''new Tag'' (if speficied) i. new Tag (string) = The new tag for all tags that have a matching ''Tagclass'' within the query list specified in ii.) If the current tag is not found within the query list it is neither ignored nor does it get a new tag assigned and the algorithm just continues the normal way taking the current tag. The benefit of this design is that there is no need to put the whole tag-set into the query system. The Error Mining Query Editor provides the functionality to group tags together, rename tags or exclude tags from the search. It is organized in three parts [[attachment:search_qe-errormining-view.png|{{attachment:search_qe-errormining-view.png|attachment:search_qe-errormining-view.png|width="400"}}]]. On the left side there are buttons to create/edit or delete a single query: * {{attachment:search_qe-add.png}} = Add a new ngram query item * {{attachment:search_qe-edit.png}} = Edit selected ngram query item [[attachment:search_qe-edit-ngramtag.png|{{attachment:search_qe-edit-ngramtag.png|attachment:search_qe-edit-ngramtag.png|height="100"}}]] * {{attachment:search_qe-delete.png}} = Delete selected ngram query item In the middle there is an overview over all specified queries represented as a list. [[attachment:search_qe-errormining-list.png|{{attachment:search_qe-errormining-list.png|attachment:search_qe-errormining-list.png|height="100"}}]] Below are three buttons to manage the ngram query item list: * {{attachment:search_qe-load.png}} = Load ngram query xml file * {{attachment:search_qe-save.png}} = Save all ngram query items to xml * {{attachment:search_qe-reset.png}} = Remove all ngram query items from list The capability of saving a query to an extensible mark-up file (xml) and load it again later is useful if the user specifies a query and wants to use it later in different corpora. Using reset will delete all specified query items. [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === Result Outline(Error Mining): === {{attachment:search_result-tab.png}} Use this tab to browse the search error mining results. ICARUS provides two views for browsing the potential errors. The {{attachment:search_variation-ngrams.png}} view shows a list of all variation n-grams found whereas the second view {{attachment:search_label-distribution.png}} shows label distribution over word forms. '''Result Outline Toolbar:''' {{attachment:search_result-em-tb.png}} * {{attachment:search_preferences.png}} = Open the preferences * {{attachment:search_result-search-desc.png}} Short query description and number of matches (note grouping is never used always "0" when viewing an error mining result) * {{attachment:search_reorder-graph.png}} = Refresh the result outline * {{attachment:search_export.png}} = Save the current search result to a XML file (may be imported later) * {{attachment:search_import.png}} = Import search result XML file * {{attachment:search_clearhistory.png}} = Close the result outline === Variation N-Gram View (Error Mining): === [[attachment:search_result_em-pos-variation.png|{{attachment:search_result_em-pos-variation.png|attachment:search_result_em-pos-variation.png|width="400"}}]] '''Variation N-Gram Toolbar''' {{attachment:search_em-variation-tb.png}} * {{attachment:search_preferences.png}} = Open the preferences * {{attachment:search_em_text-filter.png}} = Filter the variation n-gram list using the specified string * {{attachment:search_em_min-gram.png}} = Minimum n-gram size for items within the list * {{attachment:search_em_max-gram.png}} = Maximum n-gram size for items within the list * {{attachment:filter_apply.png}} = Apply variation n-gram filter * {{attachment:filter_reset.png}} = Reset variation n-gram list filter * {{attachment:sort_asc.png}} = Sort the n-gram ascending list by n-gram length * {{attachment:sort_desc.png}} = Sort the n-gram descending list by n-gram length Each variation entry has the following format "Listindex) n-gram-length Occurence-Count ngram" Example n-gram: {{attachment:search_em-single-result.png}} . * "1)" List Index * "1-gram" Length of the variation n-gram (here 1) * "100+" Variation n-gram occurence count. (100+ = more than 100 matches) * "'s" Every variation nucleus is colored purple When the user selects one n-gram additional information about the nucleus (part-of-speech tags, tagcount) is displayed below the list. To inspect the result the user may double click on an entry from the variation n-gram lis. In the example he would recieve all sentences with the nucleus "'s" (POS, VBZ and NNP) clicking on {{attachment:search_em-single-result.png||align="middle"}} If he is only interested in instances where "'s" was tagged as VBZ first he have to select the n-gram in the list and anfterwards double click on one of the lines in the lower part of the window {{attachment:search_em-single-tag.png||align="middle"}} that contain that particular combination of word form and part-of-speech tag. Each time the user clicks on a n-gram, a new tab will be created, allows the user to jump back to previous results without having to recreate them (run the search again). === Label Distribution View (Error Mining): === [[attachment:search_result_em-pos-distribution.png|{{attachment:search_result_em-pos-distribution.png|attachment:search_result_em-pos-distribution.png|width="400"}}]] '''Variation Label Distribution Toolbar''' {{attachment:search_em-distribution-tb.png}} * {{attachment:search_preferences.png}} = Open the preferences * {{attachment:search_em_text-filter.png}} = Filter the label distribution list using the specified string * {{attachment:filter_apply.png}} = Apply label distribution filter * {{attachment:filter_reset.png}} = Reset label distribution filter * {{attachment:search_show_ngram.png}} = Show sentences for the n-gram * {{attachment:search_em-labelsize.png}} = Specify n-gram size for label distribution * {{attachment:search_em-distribution.png}} = Generate new label distribution for specified {{attachment:search_em-labelsize.png}} n-gram size * {{attachment:search_export.png}} = Export barchart to "portable network graphics" (.png) (export settings can be configured in the preferences {{attachment:search_preferences.png}} On the left a list of unique label combinations is shown. Selecting one displays a list of word form that occur with exactly these tags in the corpus. This list is below {{attachment:search_result-em-label-dist-b.png||align="middle"}} . To the right the frequencies of the different labels are shown in a barchart. The left-most bar (here red) for each label always shows the total frequency. The user may select more words froms from the list to add additional bars to the chart that show the frequencies for eacht selected word form. === Results Presentation: === [[attachment:search_result_em-pos-distribution.png|{{attachment:search_result_em-pos-distribution.png|attachment:search_result_em-pos-distribution.png|width="400"}}]] [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] ---- <> === VI. Tutorials (including videos) === i. [[#tutorial-1D|Tutorial Dependency Search (passive constructions) with one grouping operator]] i. [[#tutorial-2D|Tutorial Dependency Search (passive constructions with overt logical subjects)]] i. [[#tutorial-3D|Tutorial Dependency Search (passive constructions with overt logical subjects and object)]] i. [[#tutorial-em|Tutorial Error Mining]] [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === 1) Tutorial Dependency Search (passive constructions) with one grouping operator: === Video Download: * [[http://clarin01.ims.uni-stuttgart.de/icarus/tutorial_passive_1D.mp4|tutorial_passive_1D.mp4]] * [[http://clarin01.ims.uni-stuttgart.de/icarus/tutorial_passive_1D.ogv|tutorial_passive_1D.ogv]] If the the user doesn't exactly know the how passive constructions are annotated in a treebank. Then he can use e.g. mate-tools or weblicht to parse a sentence contains a passice construction and copy&paste the structure to the search graph. 1. Parsed sentence "Mary was kissed by a boy." {{attachment:search_example_mt.png}} . 1. Select the passice construction {{attachment:search_example_mt_selected.png}} 1. Copy the selected cells and edges {{attachment:search_copy.png}} and switch to the {{attachment:search_perspective.png}} 1. Paste selected cells and edges into the search query editor window {{attachment:search_paste.png}} 1. The resulting graph when using the arc-layout (recommended) {{attachment:search_arc-layout.png}} {{attachment:search_cp-graph-arc.png}} 1. In the following step the search graph (query) will be generalized (double clicking the edge / nodes to open the edge/node editor). 1. Node 1 properties {{attachment:search_edit-node.png}} changed to {{attachment:search_edit-node-b.png}} 1. Edge properties {{attachment:search_example-edge.png}} changed to {{attachment:search_example-edge-b.png}} 1. Node 2 properties {{attachment:search_example-node2.png}} changed to (added grouping operator <*>) {{attachment:search_example-node2-b.png}} {{attachment:search_example-node2-c.png}} 1. These changes result in a new more generalized version of the search graph (below is the textual query representation) {{attachment:search_example_sg+text.png|search_example_sg+text.png}} This query matches passive constructions in English as annotated in the CoNLL08 Shared Task data set. 1. Results (1D) [[attachment:search_result_1D.png|{{attachment:search_result_1D.png|attachment:search_result_1D.png|width="400"}}]] [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === 2) Tutorial Dependency Search (passive constructions with overt logical subjects): === Video Download: * [[http://clarin01.ims.uni-stuttgart.de/icarus/tutorial_passive_2D.mp4|tutorial_passive_2D.mp4]] * [[http://clarin01.ims.uni-stuttgart.de/icarus/tutorial_passive_2D.ogv|tutorial_passive_2D.ogv]] We are interested in passive constructions with overt logical subjects, grouped by lemma of the verb and the lemma of the logical subject. We may use the search graph for passive constructions or build the query completly manually (shown here). 1. First of all clear the graph editor panel (if there is any remaining graph) using {{attachment:search_clear.png}} 1. Add four new nodes {{attachment:search_add-node.png}} you may "automatic reorder" them by clicking {{attachment:search_reorder-graph.png}} 1. Your graph editor should look like {{attachment:search_t2_4nodes.png}} 1. There are two ways connecting nodes / adding edges 1. Select two nodes {{attachment:search_t2_addingedge-a.png}} and connect them clicking on {{attachment:search_add-edge.png}} 1. Place the cursor in the middle of the desired (source) node. A green border will show up {{attachment:search_hl-node.png}} . Hold the left mousebutton and move to the (target) node. When you reached the target node again a green border shows up. Release the left mousebutton to draw an edge between those node {{attachment:search_t2_addingedge-b.png}} 1. Double click on the nodes/edges to specify the constraints. (Note: Adding constraints may mess up the graph layout. You may use {{attachment:search_reorder-graph.png}} to redraw the graph) 1. Node 1: Lemma = be {{attachment:search_t2-n1.png}} 1. Node 2: Lemma = <*> (red grouping operator); Part-Of-Speech = VBN {{attachment:search_t2-n2.png}} 1. Node 3: Form = by {{attachment:search_t2-n3.png}} 1. Node 4: Lemma = <*> (green grouping operator) {{attachment:search_t2-n4.png}} 1. Edge 1: Relation = VC {{attachment:search_t2-e1.png}} 1. Edge 2: Relation = LGS {{attachment:search_t2-e2.png}} 1. Edge 3: Relation = PMOD {{attachment:search_t2-e3.png}} 1. When every node, edge was linked and there was no error setting the constraints above the search graph should look like this: {{attachment:search_t2-sg.png}} . (Textual query: [lemma=be [relation=VC, lemma<*>1, pos=VBN [relation=LGS, form=by [relation=PMOD, lemma<*>2]]]]) 1. Results (2D) [[attachment:search_result_2D-a.png|{{attachment:search_result_2D-a.png|attachment:search_result_2D-a.png|width="400"}}]] [[attachment:search_result_2D-b.png|{{attachment:search_result_2D-b.png|attachment:search_result_2D-b.png|width="400"}}]] [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === 3) Tutorial Dependency Search (passive constructions with overt logical subjects and object): === Video Download: * [[http://clarin01.ims.uni-stuttgart.de/icarus/tutorial_passive_3D.mp4|tutorial_passive_3D.mp4]] * [[http://clarin01.ims.uni-stuttgart.de/icarus/tutorial_passive_3D.ogv|tutorial_passive_3D.ogv]] In tutorial 1) we showed how to create a query using a copied graph from the parser. Tutorial 2) shows how to create a query from scratch. In tutorial 3) we will extend the search graph used in 2) with an additional grouping operator. 1. We start with the following search graph {{attachment:search_t2-sg.png}} 1. Add one new node {{attachment:search_add-node.png}} you may "automatic reorder" them by clicking {{attachment:search_reorder-graph.png}} 1. Your graph editor should look like {{attachment:search_t3-n5added.png}} 1. Connect the "red" node with the new node using one of the following options 1. Select the node {{attachment:search_t3-addedge-c.png}} and connect them clicking on {{attachment:search_add-edge.png}} 1. Place the cursor in the middle of node 2. A green border will show up {{attachment:search_t3-addedge-a.png}} . Hold the left mousebutton and move to the new node. When you reached the target node again a green border shows up {{attachment:search_hl-node.png}} . Release the left mousebutton to draw an edge between those node {{attachment:search_t3-addedge-b.png}} 1. Double click on the new node/edge to specify the constraints. (Note: Adding constraints may mess up the graph layout. You may use {{attachment:search_reorder-graph.png}} to redraw the graph) 1. Node 5: Lemma = <*> (browngrouping operator) {{attachment:search_t3-n5.png}} 1. Edge 4: Relation = OBJ {{attachment:search_t3-e4.png}} 1. When every node, edge was linked and there was no error setting the constraints above the search graph should look like this: {{attachment:search_t3-sg.png}} . (Textual query: [lemma=be [relation=VC, lemma<*>1, pos=VBN [relation=LGS, form=by [relation=PMOD, lemma<*>2]][relation=OBJ, lemma<*>3]]]) 1. Results (3D) [[attachment:search_result_3D-a.png|{{attachment:search_result_3D-a.png|attachment:search_result_3D-a.png|height="170"}}]] [[attachment:search_result_3D-b.png|{{attachment:search_result_3D-b.png|attachment:search_result_3D-b.png|width="400"}}]] [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]] <> === 4) Tutorial Error Mining: === Video Download: * [[http://clarin01.ims.uni-stuttgart.de/icarus/tutorial_errormining.mp4|tutorial_errormining.mp4]] * [[http://clarin01.ims.uni-stuttgart.de/icarus/tutorial_errormining.ogv|tutorial_errormining.ogv]] [[#icarus-search-index|{{attachment:navi_up.png|Back to index}}]][[#icarus-search-index|Back to index]]