Import a ReaderBench output file (.csv) and GAMET output file (.csv), and merge the two files on ID.
Source:R/file_utilities.R
import_merge_gamet_rb.RdImport a ReaderBench output file (.csv) and GAMET output file (.csv), and merge the two files on ID.
Value
A base data.frame created by joining the ReaderBench and GAMET tables
by ID, with one row per matched ID and the following columns:
ID(character): identifier present in both sources.All retained ReaderBench feature columns (
numeric).All retained GAMET error/category columns (
numeric).
By default, only IDs present in both inputs are kept (inner join). If a
feature name appears in both sources, standard merge suffixes (e.g.,
.x/.y) may be applied by the join implementation.
The object has class data.frame (or tibble if converted by the user).
Examples
# Example with package sample data
rb_path <- system.file("extdata", "sample_rb.csv", package = "writeAlizer")
gam_path <- system.file("extdata", "sample_gamet.csv", package = "writeAlizer")
rb_gam <- import_merge_gamet_rb(rb_path, gam_path)
head(rb_gam)
#> [1] ID
#> [2] error_count
#> [3] word_count
#> [4] grammar
#> [5] misspelling
#> [6] duplication
#> [7] typographical
#> [8] whitespace
#> [9] per_gram
#> [10] per_spell
#> [11] Paragraphs
#> [12] Sentences
#> [13] Words
#> [14] Content.words
#> [15] RB.RdbltyFlesch
#> [16] RB.RdbltyFog
#> [17] RB.RdbltyKincaid
#> [18] RB.RdbltyDaleChall
#> [19] RB.AvgBlLen
#> [20] RB.AvgCommaBl
#> [21] RB.AvgCommaSen
#> [22] RB.AvgSenLen
#> [23] RB.AvgSenBl
#> [24] RB.AvgUnqWdBl
#> [25] RB.AvgUnqWdSen
#> [26] RB.AvgWdLen
#> [27] RB.AvgWdBl
#> [28] RB.AvgWdSen
#> [29] RB.CharEnt
#> [30] RB.BlStDevSen
#> [31] RB.BlStDevUnq
#> [32] RB.SenStDevUnqWd
#> [33] RB.BlStdDevWd
#> [34] RB.SenStdDevWd
#> [35] RB.WdEnt
#> [36] RB.WdLettStdDev
#> [37] RB.LxcDiv
#> [38] RB.LxcSoph
#> [39] RB.SynDiv
#> [40] RB.SynSoph
#> [41] RB.CAF
#> [42] RB.AvgNounBl
#> [43] RB.AvgPronounBl
#> [44] RB.AvgVerbBl
#> [45] RB.AvgAdverbBl
#> [46] RB.AvgAdjectiveBl
#> [47] RB.AvgPrepositionBl
#> [48] RB.AvgNounSen
#> [49] RB.AvgPronounSen
#> [50] RB.AvgVerbSen
#> [51] RB.AvgAdverbSen
#> [52] RB.AvgAdjectiveSen
#> [53] RB.AvgPrepositionSen
#> [54] RB.AvgUnqNoundBl
#> [55] RB.AvgUnqPronounBl
#> [56] RB.AvgUnqVerbBl
#> [57] RB.AvgUnqAdverbBl
#> [58] RB.AvgUnqAdjectiveBl
#> [59] RB.AvgUnqPrepositionBl
#> [60] RB.AvgPronBl_first_person
#> [61] RB.AggPronSen_first_person
#> [62] RB.AvgPronBl_indefinite
#> [63] RB.AggPronSen_indefinite
#> [64] RB.AvgPronBl_interrogative
#> [65] RB.AggPronSen_interrogative
#> [66] RB.AvgPronBl_second_person
#> [67] RB.AggPronSen_second_person
#> [68] RB.AvgPronBl_third_person
#> [69] RB.AggPronSen_third_person
#> [70] RB.AvgSemDep
#> [71] RB.WdDiffLemmaStem
#> [72] RB.WdDiffWdStem
#> [73] RB.WdMaxDpthHypernymTree
#> [74] RB.WdAvgDpthHypernymTree
#> [75] RB.WdPathCntHypernymTree
#> [76] RB.WdPolysemyCnt
#> [77] RB.WdSylCnt
#> [78] RB.AvgAOADoc_Shock
#> [79] RB.AvgAOABl_Shock
#> [80] RB.AvgAOASen_Shock
#> [81] RB.AvgAOADoc_Cortese
#> [82] RB.AvgAOABl_Cortese
#> [83] RB.AvgAOASen_Cortese
#> [84] RB.AvgAOADoc_Kuperman
#> [85] RB.AvgAOABl_Kuperman
#> [86] RB.AvgAOASen_Kuperman
#> [87] RB.AvgAOADoc_Bird
#> [88] RB.AvgAOABl_Bird
#> [89] RB.AvgAOASen_Bird
#> [90] RB.AvgAOADoc_Bristol
#> [91] RB.AvgAOABl_Bristol
#> [92] RB.AvgAOASen_Bristol
#> [93] RB.AvgAOEDoc_IndexPolynomialFitAboveThreshold.0.3.
#> [94] RB.AvgAOEBl_IndexPolynomialFitAboveThreshold.0.3.
#> [95] RB.AvgAOESen_IndexPolynomialFitAboveThreshold.0.3.
#> [96] RB.AvgAOEDoc_InverseLinearRegressionSlope
#> [97] RB.AvgAOEBl_InverseLinearRegressionSlope
#> [98] RB.AvgAOESen_InverseLinearRegressionSlope
#> [99] RB.AvgAOEDoc_InflectionPointPolynomial
#> [100] RB.AvgAOEBl_InflectionPointPolynomial
#> [101] RB.AvgAOESen_InflectionPointPolynomial
#> [102] RB.AvgAOEDoc_InverseAverage
#> [103] RB.AvgAOEBl_InverseAverage
#> [104] RB.AvgAOESen_InverseAverage
#> [105] RB.AvgAOEDoc_IndexAboveThreshold.0.3.
#> [106] RB.AvgAOEBl_IndexAboveThreshold.0.3.
#> [107] RB.AvgAOESen_IndexAboveThreshold.0.3.
#> [108] RB.AvgNmdEntBl
#> [109] RB.AvgNounNmdEntBl
#> [110] RB.AvgUnqNmdEntBl
#> [111] RB.AvgNmdEntSen
#> [112] RB.TCorefChainDoc
#> [113] RB.AvgCorefChain
#> [114] RB.AvgChainSpan
#> [115] RB.AvgInferenceDistChain
#> [116] RB.TActCorefChainWd
#> [117] RB.TCorefChainBigSpan
#> [118] RB.AvgConnBl_addition
#> [119] RB.AvgConnSen_addition
#> [120] RB.AvgConnBl_complex_subordinators
#> [121] RB.AvgConnSen_complex_subordinators
#> [122] RB.AvgConnBl_concessions
#> [123] RB.AvgConnSen_concessions
#> [124] RB.AvgConnBl_conditions
#> [125] RB.AvgConnSen_conditions
#> [126] RB.AvgConnBl_conjunctions
#> [127] RB.AvgConnSen_conjunctions
#> [128] RB.AvgConnBl_conjuncts
#> [129] RB.AvgConnSen_conjuncts
#> [130] RB.AvgConnBl_contrasts
#> [131] RB.AvgConnSen_contrasts
#> [132] RB.AvgConnBl_coordinating_conjuncts
#> [133] RB.AvgConnSen_coordinating_conjuncts
#> [134] RB.AvgConnBl_coordinating_connectives
#> [135] RB.AvgConnSen_coordinating_connectives
#> [136] RB.AvgConnBl_disjunctions
#> [137] RB.AvgConnSen_disjunctions
#> [138] RB.AvgConnBl_logical_connectors
#> [139] RB.AvgConnSen_logical_connectors
#> [140] RB.AvgConnBl_oppositions
#> [141] RB.AvgConnSen_oppositions
#> [142] RB.AvgConnBl_order
#> [143] RB.AvgConnSen_order
#> [144] RB.AvgConnBl_quasi_coordinators
#> [145] RB.AvgConnSen_quasi_coordinators
#> [146] RB.AvgConnBl_reason_and_purpose
#> [147] RB.AvgConnSen_reason_and_purpose
#> [148] RB.AvgConnBl_reference
#> [149] RB.AvgConnSen_reference
#> [150] RB.AvgConnBl_semi_coordinators
#> [151] RB.AvgConnSen_semi_coordinators
#> [152] RB.AvgConnBl_sentence_linking
#> [153] RB.AvgConnSen_sentence_linking
#> [154] RB.AvgConnBl_simple_subordinators
#> [155] RB.AvgConnSen_simple_subordinators
#> [156] RB.AvgConnBl_temporal_connectors
#> [157] RB.AvgConnSen_temporal_connectors
#> [158] RB.LexChainAvgSpan
#> [159] RB.LexChainMaxSp
#> [160] RB.AvgLexChain
#> [161] RB.PercLexChainCoverage
#> [162] RB.AvgBlScore
#> [163] RB.AvgSenScore
#> [164] RB.BlScoreStDev
#> [165] RB.SenScoreStDev
#> [166] RB.AvgBlAdjCoh_LeackockChodorow
#> [167] RB.AvgBlDocCoh_LeackockChodorow
#> [168] RB.AvgInterBlCoh_LeackockChodorow
#> [169] RB.AvgIntraBlCoh_LeackockChodorow
#> [170] RB.AvgMidEndCoh_LeackockChodorow
#> [171] RB.AvgSenAdjCoh_LeackockChodorow
#> [172] RB.AvgSenBlCoh_LeackockChodorow
#> [173] RB.AvgStartMidCoh_LeackockChodorow
#> [174] RB.AvgTransCoh_LeackockChodorow
#> [175] RB.AvgStartEndCoh_LeackockChodorow
#> [176] RB.DocFlAbsPosAcc_LeackockChodorow_MaxVal
#> [177] RB.DocFlAbsDistAcc_LeackockChodorow_MaxVal
#> [178] RB.DocFlAdjAcc_LeackockChodorow_MaxVal
#> [179] RB.DocFlAvgCoh_LeackockChodorow_MaxVal
#> [180] RB.DocFlMaxOrdSeq_LeackockChodorow_MaxVal
#> [181] RB.DocFlSpearmamCorr_LeackockChodorow_MaxVal
#> [182] RB.DocFlAbsPosAcc_LeackockChodorow_AbvMeanStdev
#> [183] RB.DocFlAbsDistAcc_LeackockChodorow_AbvMeanStdev
#> [184] RB.DocFlAdjAcc_LeackockChodorow_AbvMeanStdev
#> [185] RB.DocFlAvgCoh_LeackockChodorow_AbvMeanStdev
#> [186] RB.DocFlMaxOrdSeq_LeackockChodorow_AbvMeanStdev
#> [187] RB.DocFlSpearmamCorr_LeackockChodorow_AbvMeanStdev
#> [188] RB.AvgBlAdjCoh_WuPalmer
#> [189] RB.AvgBlDocCoh_WuPalmer
#> [190] RB.AvgInterBlCoh_WuPalmer
#> [191] RB.AvgIntraBlCoh_WuPalmer
#> [192] RB.AvgMidEndCoh_WuPalmer
#> [193] RB.AvgSenAdjCoh_WuPalmer
#> [194] RB.AvgSenBlCoh_WuPalmer
#> [195] RB.AvgStartMidCoh_WuPalmer
#> [196] RB.AvgTransCoh_WuPalmer
#> [197] RB.AvgStartEndCoh_WuPalmer
#> [198] RB.DocFlAbsPosAcc_WuPalmer_MaxVal
#> [199] RB.DocFlAbsDistAcc_WuPalmer_MaxVal
#> [200] RB.DocFlAdjAcc_WuPalmer_MaxVal
#> [201] RB.DocFlAvgCoh_WuPalmer_MaxVal
#> [202] RB.DocFlMaxOrdSeq_WuPalmer_MaxVal
#> [203] RB.DocFlSpearmamCorr_WuPalmer_MaxVal
#> [204] RB.DocFlAbsPosAcc_WuPalmer_AbvMeanStdev
#> [205] RB.DocFlAbsDistAcc_WuPalmer_AbvMeanStdev
#> [206] RB.DocFlAdjAcc_WuPalmer_AbvMeanStdev
#> [207] RB.DocFlAvgCoh_WuPalmer_AbvMeanStdev
#> [208] RB.DocFlMaxOrdSeq_WuPalmer_AbvMeanStdev
#> [209] RB.DocFlSpearmamCorr_WuPalmer_AbvMeanStdev
#> [210] RB.AvgBlAdjCoh_Path
#> [211] RB.AvgBlDocCoh_Path
#> [212] RB.AvgInterBlCoh_Path
#> [213] RB.AvgIntraBlCoh_Path
#> [214] RB.AvgMidEndCoh_Path
#> [215] RB.AvgSenAdjCoh_Path
#> [216] RB.AvgSenBlCoh_Path
#> [217] RB.AvgStartMidCoh_Path
#> [218] RB.AvgTransCoh_Path
#> [219] RB.AvgStartEndCoh_Path
#> [220] RB.DocFlAbsPosAcc_Path_MaxVal
#> [221] RB.DocFlAbsDistAcc_Path_MaxVal
#> [222] RB.DocFlAdjAcc_Path_MaxVal
#> [223] RB.DocFlAvgCoh_Path_MaxVal
#> [224] RB.DocFlMaxOrdSeq_Path_MaxVal
#> [225] RB.DocFlSpearmamCorr_Path_MaxVal
#> [226] RB.DocFlAbsPosAcc_Path_AbvMeanStdev
#> [227] RB.DocFlAbsDistAcc_Path_AbvMeanStdev
#> [228] RB.DocFlAdjAcc_Path_AbvMeanStdev
#> [229] RB.DocFlAvgCoh_Path_AbvMeanStdev
#> [230] RB.DocFlMaxOrdSeq_Path_AbvMeanStdev
#> [231] RB.DocFlSpearmamCorr_Path_AbvMeanStdev
#> [232] RB.AvgBlAdjCoh_LSA
#> [233] RB.AvgBlDocCoh_LSA
#> [234] RB.AvgInterBlCoh_LSA
#> [235] RB.AvgIntraBlCoh_LSA
#> [236] RB.AvgMidEndCoh_LSA
#> [237] RB.AvgSenAdjCoh_LSA
#> [238] RB.AvgSenBlCoh_LSA
#> [239] RB.AvgStartMidCoh_LSA
#> [240] RB.AvgTransCoh_LSA
#> [241] RB.AvgStartEndCoh_LSA
#> [242] RB.DocFlAbsPosAcc_LSA_MaxVal
#> [243] RB.DocFlAbsDistAcc_LSA_MaxVal
#> [244] RB.DocFlAdjAcc_LSA_MaxVal
#> [245] RB.DocFlAvgCoh_LSA_MaxVal
#> [246] RB.DocFlMaxOrdSeq_LSA_MaxVal
#> [247] RB.DocFlSpearmamCorr_LSA_MaxVal
#> [248] RB.DocFlAbsPosAcc_LSA_AbvMeanStdev
#> [249] RB.DocFlAbsDistAcc_LSA_AbvMeanStdev
#> [250] RB.DocFlAdjAcc_LSA_AbvMeanStdev
#> [251] RB.DocFlAvgCoh_LSA_AbvMeanStdev
#> [252] RB.DocFlMaxOrdSeq_LSA_AbvMeanStdev
#> [253] RB.DocFlSpearmamCorr_LSA_AbvMeanStdev
#> [254] RB.AvgBlAdjCoh_LDA
#> [255] RB.AvgBlDocCoh_LDA
#> [256] RB.AvgInterBlCoh_LDA
#> [257] RB.AvgIntraBlCoh_LDA
#> [258] RB.AvgMidEndCoh_LDA
#> [259] RB.AvgSenAdjCoh_LDA
#> [260] RB.AvgSenBlCoh_LDA
#> [261] RB.AvgStartMidCoh_LDA
#> [262] RB.AvgTransCoh_LDA
#> [263] RB.AvgStartEndCoh_LDA
#> [264] RB.DocFlAbsPosAcc_LDA_MaxVal
#> [265] RB.DocFlAbsDistAcc_LDA_MaxVal
#> [266] RB.DocFlAdjAcc_LDA_MaxVal
#> [267] RB.DocFlAvgCoh_LDA_MaxVal
#> [268] RB.DocFlMaxOrdSeq_LDA_MaxVal
#> [269] RB.DocFlSpearmamCorr_LDA_MaxVal
#> [270] RB.DocFlAbsPosAcc_LDA_AbvMeanStdev
#> [271] RB.DocFlAbsDistAcc_LDA_AbvMeanStdev
#> [272] RB.DocFlAdjAcc_LDA_AbvMeanStdev
#> [273] RB.DocFlAvgCoh_LDA_AbvMeanStdev
#> [274] RB.DocFlMaxOrdSeq_LDA_AbvMeanStdev
#> [275] RB.DocFlSpearmamCorr_LDA_AbvMeanStdev
#> [276] RB.AvgBlAdjCoh_word2vec
#> [277] RB.AvgBlDocCoh_word2vec
#> [278] RB.AvgInterBlCoh_word2vec
#> [279] RB.AvgIntraBlCoh_word2vec
#> [280] RB.AvgMidEndCoh_word2vec
#> [281] RB.AvgSenAdjCoh_word2vec
#> [282] RB.AvgSenBlCoh_word2vec
#> [283] RB.AvgStartMidCoh_word2vec
#> [284] RB.AvgTransCoh_word2vec
#> [285] RB.AvgStartEndCoh_word2vec
#> [286] RB.DocFlAbsPosAcc_word2vec_MaxVal
#> [287] RB.DocFlAbsDistAcc_word2vec_MaxVal
#> [288] RB.DocFlAdjAcc_word2vec_MaxVal
#> [289] RB.DocFlAvgCoh_word2vec_MaxVal
#> [290] RB.DocFlMaxOrdSeq_word2vec_MaxVal
#> [291] RB.DocFlSpearmamCorr_word2vec_MaxVal
#> [292] RB.DocFlAbsPosAcc_word2vec_AbvMeanStdev
#> [293] RB.DocFlAbsDistAcc_word2vec_AbvMeanStdev
#> [294] RB.DocFlAdjAcc_word2vec_AbvMeanStdev
#> [295] RB.DocFlAvgCoh_word2vec_AbvMeanStdev
#> [296] RB.DocFlMaxOrdSeq_word2vec_AbvMeanStdev
#> [297] RB.DocFlSpearmamCorr_word2vec_AbvMeanStdev
#> [298] RB.AvgBlVoiceCoOcc
#> [299] RB.AvgSenVoiceCoOcc
#> [300] RB.BlVoiceCoOccStDev
#> [301] RB.SenVoiceCoOccStDev
#> [302] RB.AvgBlVoiceCumEff
#> [303] RB.AvgSenVoiceCumEff
#> [304] RB.BlVoiceCumEffStDev
#> [305] RB.SenVoiceCumEffStDev
#> [306] RB.AvgBlVoiceMI
#> [307] RB.AvgSenVoiceMI
#> [308] RB.BlVoiceMIStDev
#> [309] RB.SenVoiceMIStDev
#> [310] RB.AvgVoice
#> [311] RB.AvgVoiceBlDist
#> [312] RB.AvgVoiceBlEnt
#> [313] RB.AvgVoiceReccBl
#> [314] RB.AvgVoiceReccSen
#> [315] RB.AvgVoiceSenDist
#> [316] RB.AvgVoiceSenEnt
#> [317] RB.VoiceBlDistStDev
#> [318] RB.VoiceReccBlDistStDev
#> [319] RB.VoiceReccSenStDev
#> [320] RB.VoiceSenDistStDev
#> [321] RB.VoiceAvgSpan
#> [322] RB.VoiceMaxSpan
#> [323] RB.AvgSenSyll
#> [324] RB.AvgSenStressedSyll
#> [325] RB.AvgRhythmUnits
#> [326] RB.AvgRhythmUnitSyll
#> [327] RB.AvgRhythmUnitStreesSyll
#> [328] RB.LangRhythmCoeff
#> [329] RB.LangRhythmId
#> [330] RB.FrqRhythmId
#> [331] RB.LangRhythmDiameter
#> [332] RB.SenAllit
#> [333] RB.SenAsson
#> [334] RB.AvgDepsBl_acl
#> [335] RB.AvgDepsSen_acl
#> [336] RB.AvgDepsBl_advcl
#> [337] RB.AvgDepsSen_advcl
#> [338] RB.AvgDepsBl_advmod
#> [339] RB.AvgDepsSen_advmod
#> [340] RB.AvgDepsBl_amod
#> [341] RB.AvgDepsSen_amod
#> [342] RB.AvgDepsBl_appos
#> [343] RB.AvgDepsSen_appos
#> [344] RB.AvgDepsBl_aux
#> [345] RB.AvgDepsSen_aux
#> [346] RB.AvgDepsBl_auxpass
#> [347] RB.AvgDepsSen_auxpass
#> [348] RB.AvgDepsBl_case
#> [349] RB.AvgDepsSen_case
#> [350] RB.AvgDepsBl_cc
#> [351] RB.AvgDepsSen_cc
#> [352] RB.AvgDepsBl_ccomp
#> [353] RB.AvgDepsSen_ccomp
#> [354] RB.AvgDepsBl_compound
#> [355] RB.AvgDepsSen_compound
#> [356] RB.AvgDepsBl_conj
#> [357] RB.AvgDepsSen_conj
#> [358] RB.AvgDepsBl_cop
#> [359] RB.AvgDepsSen_cop
#> [360] RB.AvgDepsBl_csubj
#> [361] RB.AvgDepsSen_csubj
#> [362] RB.AvgDepsBl_csubjpass
#> [363] RB.AvgDepsSen_csubjpass
#> [364] RB.AvgDepsBl_dep
#> [365] RB.AvgDepsSen_dep
#> [366] RB.AvgDepsBl_det
#> [367] RB.AvgDepsSen_det
#> [368] RB.AvgDepsBl_discourse
#> [369] RB.AvgDepsSen_discourse
#> [370] RB.AvgDepsBl_dislocated
#> [371] RB.AvgDepsSen_dislocated
#> [372] RB.AvgDepsBl_dobj
#> [373] RB.AvgDepsSen_dobj
#> [374] RB.AvgDepsBl_expl
#> [375] RB.AvgDepsSen_expl
#> [376] RB.AvgDepsBl_foreign
#> [377] RB.AvgDepsSen_foreign
#> [378] RB.AvgDepsBl_goeswith
#> [379] RB.AvgDepsSen_goeswith
#> [380] RB.AvgDepsBl_iobj
#> [381] RB.AvgDepsSen_iobj
#> [382] RB.AvgDepsBl_list
#> [383] RB.AvgDepsSen_list
#> [384] RB.AvgDepsBl_mark
#> [385] RB.AvgDepsSen_mark
#> [386] RB.AvgDepsBl_mwe
#> [387] RB.AvgDepsSen_mwe
#> [388] RB.AvgDepsBl_name
#> [389] RB.AvgDepsSen_name
#> [390] RB.AvgDepsBl_neg
#> [391] RB.AvgDepsSen_neg
#> [392] RB.AvgDepsBl_nmod
#> [393] RB.AvgDepsSen_nmod
#> [394] RB.AvgDepsBl_nsubj
#> [395] RB.AvgDepsSen_nsubj
#> [396] RB.AvgDepsBl_nsubjpass
#> [397] RB.AvgDepsSen_nsubjpass
#> [398] RB.AvgDepsBl_nummod
#> [399] RB.AvgDepsSen_nummod
#> [400] RB.AvgDepsBl_parataxis
#> [401] RB.AvgDepsSen_parataxis
#> [402] RB.AvgDepsBl_punct
#> [403] RB.AvgDepsSen_punct
#> [404] RB.AvgDepsBl_remnant
#> [405] RB.AvgDepsSen_remnant
#> [406] RB.AvgDepsBl_reparandum
#> [407] RB.AvgDepsSen_reparandum
#> [408] RB.AvgDepsBl_root
#> [409] RB.AvgDepsSen_root
#> [410] RB.AvgDepsBl_vocative
#> [411] RB.AvgDepsSen_vocative
#> [412] RB.AvgDepsBl_xcomp
#> [413] RB.AvgDepsSen_xcomp
#> <0 rows> (or 0-length row.names)