[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Ajuda com transformação de arquivos
From: |
Jeiks |
Subject: |
Ajuda com transformação de arquivos |
Date: |
Wed, 10 Nov 2010 14:31:41 -0200 |
Olá lista,
estou com um arquivo de 4898431 linhas, que segue o seguinte padrão:
0,tcp,http,SF,219,1098,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,1,1,0.00,0.00,0.00,0.00,1.00,0.00,0.00,7,255,1.00,0.00,0.14,0.05,0.00,0.01,0.00,0.00,normal.
0,udp,domain_u,SF,30,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,2,0.00,0.00,0.00,0.00,1.00,0.00,1.00,46,9,0.20,0.11,0.20,0.00,0.00,0.00,0.00,0.00,normal.
estou convertendo os valores para organizá-los como entrada de uma rede
neural.
Bom, fiz o script abaixo, mas pelas minhas contas ele vai demorar 25h
para terminar... e eu ainda tenho outros arquivos.
Gostaria de uma ajuda para a otimização do script.
Informações:
especif.h (biblioteca de vetores indexados - necessita de bash 4.0) ->
http://pastebin.com/AMxEvSYd
organiza_fonte.sh -> http://pastebin.com/XxNrBX4y
execução: ./organiza_fonte.sh arquivo > arquivo_saida
arquivo completo de entrada:
http://kdd.ics.uci.edu/databases/kddcup99/kddcup.data.gz
10% do arquivo de entrada:
http://kdd.ics.uci.edu/databases/kddcup99/kddcup.data_10_percent.gz
obrigado a todos
--
Jacson R. C. Silva
[As partes desta mensagem que não continham texto foram removidas]