You are in the accessibility menu

Please use this identifier to cite or link to this item: http://acervodigital.unesp.br/handle/11449/72860
Full metadata record
DC FieldValueLanguage
dc.contributor.authorDe Andrade, Tiago Luís-
dc.contributor.authorDe Souza, Rogéria Cristiane Gratão-
dc.contributor.authorBabini, Maurizio-
dc.contributor.authorValêncio, Carlos Roberto-
dc.date.accessioned2014-05-27T11:26:14Z-
dc.date.accessioned2016-10-25T18:35:52Z-
dc.date.available2014-05-27T11:26:14Z-
dc.date.available2016-10-25T18:35:52Z-
dc.date.issued2011-12-01-
dc.identifierhttp://dx.doi.org/10.1109/PDCAT.2011.58-
dc.identifier.citationParallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings, p. 299-304.-
dc.identifier.urihttp://hdl.handle.net/11449/72860-
dc.identifier.urihttp://acervodigital.unesp.br/handle/11449/72860-
dc.description.abstractAiming to ensure greater reliability and consistency of data stored in the database, the data cleaning stage is set early in the process of Knowledge Discovery in Databases (KDD) and is responsible for eliminating problems and adjust the data for the later stages, especially for the stage of data mining. Such problems occur in the instance level and schema, namely, missing values, null values, duplicate tuples, values outside the domain, among others. Several algorithms were developed to perform the cleaning step in databases, some of them were developed specifically to work with the phonetics of words, since a word can be written in different ways. Within this perspective, this work presents as original contribution an optimization of algorithm for the detection of duplicate tuples in databases through phonetic based on multithreading without the need for trained data, as well as an independent environment of language to be supported for this. © 2011 IEEE.en
dc.format.extent299-304-
dc.language.isoeng-
dc.sourceScopus-
dc.subjectAlgorithm-
dc.subjectData cleansing-
dc.subjectDuplicated tuples-
dc.subjectData cleaning-
dc.subjectKnowledge discovery in database-
dc.subjectMissing values-
dc.subjectMulti-threading-
dc.subjectNull value-
dc.subjectDatabase systems-
dc.subjectLinguistics-
dc.subjectOptimization-
dc.subjectAlgorithms-
dc.titleOptimization of algorithm to identification of duplicate tuples through similarity phonetic based on multithreadingen
dc.typeoutro-
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)-
dc.description.affiliationDepto. de Ciências de Computação e Estatística Universidade Estadual Paulista - Unesp, São José do Rio Preto-
dc.description.affiliationDepartamento de Letras Modernas Universidade Estadual Paulista - Unesp, São José do Rio Preto-
dc.description.affiliationUnespDepto. de Ciências de Computação e Estatística Universidade Estadual Paulista - Unesp, São José do Rio Preto-
dc.description.affiliationUnespDepartamento de Letras Modernas Universidade Estadual Paulista - Unesp, São José do Rio Preto-
dc.identifier.doi10.1109/PDCAT.2011.58-
dc.rights.accessRightsAcesso restrito-
dc.relation.ispartofParallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings-
dc.identifier.scopus2-s2.0-84856660893-
Appears in Collections:Artigos, TCCs, Teses e Dissertações da Unesp

There are no files associated with this item.
 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.