Search in sources :

Example 16 with Extractors

use of org.neo4j.csv.reader.Extractors in project neo4j by neo4j.

the class DataFactoriesTest method shouldFailForDuplicateIdHeaderEntries.

@Test
public void shouldFailForDuplicateIdHeaderEntries() throws Exception {
    // GIVEN
    CharSeeker seeker = seeker("one:id\ttwo:id");
    IdType idType = IdType.ACTUAL;
    Extractors extractors = new Extractors('\t');
    var e = assertThrows(DuplicateHeaderException.class, () -> defaultFormatNodeFileHeader().create(seeker, TABS, idType, groups));
    assertEquals(entry("one", Type.ID, extractors.long_()), e.getFirst());
    assertEquals(entry("two", Type.ID, extractors.long_()), e.getOther());
}
Also used : Extractors(org.neo4j.csv.reader.Extractors) CharSeeker(org.neo4j.csv.reader.CharSeeker) IdType(org.neo4j.internal.batchimport.input.IdType) Test(org.junit.jupiter.api.Test)

Example 17 with Extractors

use of org.neo4j.csv.reader.Extractors in project neo4j by neo4j.

the class DataFactoriesTest method shouldParseDefaultNodeFileHeaderCorrectly.

@Test
public void shouldParseDefaultNodeFileHeaderCorrectly() throws Exception {
    // GIVEN
    CharSeeker seeker = seeker("ID:ID,label-one:label,also-labels:LABEL,name,age:long,location:Point{crs:WGS-84}");
    IdType idType = IdType.STRING;
    Extractors extractors = new Extractors(',');
    // WHEN
    Header header = defaultFormatNodeFileHeader().create(seeker, COMMAS, idType, groups);
    // THEN
    assertArrayEquals(array(entry("ID", Type.ID, CsvInput.idExtractor(idType, extractors)), entry("label-one", Type.LABEL, extractors.stringArray()), entry("also-labels", Type.LABEL, extractors.stringArray()), entry("name", Type.PROPERTY, extractors.string()), entry("age", Type.PROPERTY, extractors.long_()), entry("location", Type.PROPERTY, extractors.point(), PointValue.parseHeaderInformation("{crs:WGS-84}"))), header.entries());
    seeker.close();
}
Also used : Extractors(org.neo4j.csv.reader.Extractors) CharSeeker(org.neo4j.csv.reader.CharSeeker) DataFactories.defaultFormatNodeFileHeader(org.neo4j.internal.batchimport.input.csv.DataFactories.defaultFormatNodeFileHeader) DataFactories.defaultFormatRelationshipFileHeader(org.neo4j.internal.batchimport.input.csv.DataFactories.defaultFormatRelationshipFileHeader) IdType(org.neo4j.internal.batchimport.input.IdType) Test(org.junit.jupiter.api.Test)

Example 18 with Extractors

use of org.neo4j.csv.reader.Extractors in project neo4j by neo4j.

the class DataFactoriesTest method shouldAllowMissingIdHeaderEntry.

@Test
public void shouldAllowMissingIdHeaderEntry() throws Exception {
    // GIVEN
    CharSeeker seeker = seeker("one\ttwo");
    Extractors extractors = new Extractors(';');
    // WHEN
    Header header = defaultFormatNodeFileHeader().create(seeker, TABS, IdType.ACTUAL, groups);
    // THEN
    assertArrayEquals(array(entry("one", Type.PROPERTY, extractors.string()), entry("two", Type.PROPERTY, extractors.string())), header.entries());
    seeker.close();
}
Also used : Extractors(org.neo4j.csv.reader.Extractors) CharSeeker(org.neo4j.csv.reader.CharSeeker) DataFactories.defaultFormatNodeFileHeader(org.neo4j.internal.batchimport.input.csv.DataFactories.defaultFormatNodeFileHeader) DataFactories.defaultFormatRelationshipFileHeader(org.neo4j.internal.batchimport.input.csv.DataFactories.defaultFormatRelationshipFileHeader) Test(org.junit.jupiter.api.Test)

Example 19 with Extractors

use of org.neo4j.csv.reader.Extractors in project neo4j by neo4j.

the class DataFactoriesTest method shouldParseHeaderFromFirstLineOfFirstInputFile.

@Test
public void shouldParseHeaderFromFirstLineOfFirstInputFile() throws Exception {
    // GIVEN
    final CharReadable firstSource = wrap("id:ID\tname:String\tbirth_date:long");
    final CharReadable secondSource = wrap("0\tThe node\t123456789");
    DataFactory dataFactory = DataFactories.data(value -> value, () -> new MultiReadable(Readables.iterator(IOFunctions.identity(), firstSource, secondSource)));
    Header.Factory headerFactory = defaultFormatNodeFileHeader();
    Extractors extractors = new Extractors(';');
    // WHEN
    CharSeeker seeker = CharSeekers.charSeeker(new MultiReadable(dataFactory.create(TABS).stream()), TABS, false);
    Header header = headerFactory.create(seeker, TABS, IdType.ACTUAL, groups);
    // THEN
    assertArrayEquals(array(entry("id", Type.ID, extractors.long_()), entry("name", Type.PROPERTY, extractors.string()), entry("birth_date", Type.PROPERTY, extractors.long_())), header.entries());
    seeker.close();
}
Also used : MultiReadable(org.neo4j.csv.reader.MultiReadable) Extractors(org.neo4j.csv.reader.Extractors) DataFactories.defaultFormatNodeFileHeader(org.neo4j.internal.batchimport.input.csv.DataFactories.defaultFormatNodeFileHeader) DataFactories.defaultFormatRelationshipFileHeader(org.neo4j.internal.batchimport.input.csv.DataFactories.defaultFormatRelationshipFileHeader) CharSeeker(org.neo4j.csv.reader.CharSeeker) CharReadable(org.neo4j.csv.reader.CharReadable) Test(org.junit.jupiter.api.Test)

Example 20 with Extractors

use of org.neo4j.csv.reader.Extractors in project neo4j by neo4j.

the class DataFactoriesTest method shouldFailForDuplicatePropertyHeaderEntries.

@Test
public void shouldFailForDuplicatePropertyHeaderEntries() throws Exception {
    // GIVEN
    CharSeeker seeker = seeker("one:id\tname\tname:long");
    IdType idType = IdType.ACTUAL;
    Extractors extractors = new Extractors('\t');
    var e = assertThrows(DuplicateHeaderException.class, () -> defaultFormatNodeFileHeader().create(seeker, TABS, idType, groups));
    assertEquals(entry("name", Type.PROPERTY, extractors.string()), e.getFirst());
    assertEquals(entry("name", Type.PROPERTY, extractors.long_()), e.getOther());
    seeker.close();
}
Also used : Extractors(org.neo4j.csv.reader.Extractors) CharSeeker(org.neo4j.csv.reader.CharSeeker) IdType(org.neo4j.internal.batchimport.input.IdType) Test(org.junit.jupiter.api.Test)

Aggregations

Extractors (org.neo4j.csv.reader.Extractors)22 CharSeeker (org.neo4j.csv.reader.CharSeeker)18 Test (org.junit.jupiter.api.Test)11 IdType (org.neo4j.internal.batchimport.input.IdType)10 Test (org.junit.Test)8 DataFactories.defaultFormatNodeFileHeader (org.neo4j.internal.batchimport.input.csv.DataFactories.defaultFormatNodeFileHeader)8 DataFactories.defaultFormatRelationshipFileHeader (org.neo4j.internal.batchimport.input.csv.DataFactories.defaultFormatRelationshipFileHeader)8 DataFactories.defaultFormatNodeFileHeader (org.neo4j.unsafe.impl.batchimport.input.csv.DataFactories.defaultFormatNodeFileHeader)6 IOException (java.io.IOException)3 ParallelBatchImporter (org.neo4j.internal.batchimport.ParallelBatchImporter)2 DataGeneratorInput (org.neo4j.internal.batchimport.input.DataGeneratorInput)2 Input (org.neo4j.internal.batchimport.input.Input)2 JobScheduler (org.neo4j.scheduler.JobScheduler)2 DuplicateHeaderException (org.neo4j.unsafe.impl.batchimport.input.DuplicateHeaderException)2 Reader (java.io.Reader)1 StringReader (java.io.StringReader)1 Path (java.nio.file.Path)1 Config (org.neo4j.configuration.Config)1 CharReadable (org.neo4j.csv.reader.CharReadable)1 Configuration (org.neo4j.csv.reader.Configuration)1